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1 Introduction 



Metamaterials are beginning to transform optics and microwave technology thanks 
to their versatile properties that, in many cases, can be tailored according to prac- 
tical needs and desires (201 ED El SHI EZl EHl EEl USD]. Although metamaterials 
are surely not the answer to all engineering problems, they have inspired a series of 
significant technological developments and also some imaginative research, because 
they invite researchers and inventors to dream. Imagine there were no practical 
limits on the electromagnetic properties of materials. What is possible? And what 
is not? If there are no practical limits, what are the fundamental limits? Such 
questions inspire taking a fresh look at the foundations of optics [llj and at con- 
nections between optics and other areas of physics. In this article we discuss such 
a connection, the relationship between optics and general relativity, or, expressed 
more precisely, between geometrical ideas normally applied in general relativity and 
the propagation of light, or electromagnetic waves in general, in materials 
Farfetched as it may appear, general relativity turns out [71] to have been put to 
practical use in the first working prototype of an electromagnetic cloaking device 
|122] , it gives perhaps the most elegant approach to achieving invisibility (721 1104] , 
and [71] , general relativity even works behind the scenes of perfect lenses |10H 1143] . 

The practical use of general relativity in electrical and optical engineering may 
seem surprisingly unorthodox: traditionally, relativity has been associated with the 
physics of gravitation [27] and cosmology [99] or, in engineering |142] . has been 
considered a complication, not a simplification. For example, the Global Positioning 
System would not be as accurate as it is without taking relativistic corrections into 
account that are due to gravity and the motion of the navigation satellites. However, 
here we are not concerned with the influence of the natural geometry of space and 
time on optics, the space-time curvature due to gravity, but rather we show how 
optical materials create artificial geometries for light and how such geometries can 
be exploited in designing novel optical devices. 

Connections between geometry and optics are nothing new; the ideas we ex- 
plain here are rather, to borrow a phrase of Sir Michael Berry, "new things in old 
things". These ideas are based on Fermat's principle [H] formulated in 1662 by 
Pierre de Fermat, but anticipated nearly a millennium ago by the Arab scientist Ibn 
al-Haytham and inspired by the Greek polymath Hero of Alexandria's reflections on 
light almost two millennia ago. According to Fermat's principle, light rays follow 
extremal optical paths in materials (shortest or longest, mostly shortest) where the 
length measure is given by the refractive index. Media change the measure of length. 
This means that any optical medium establishes a geometry [121 1119] 1121] : the glass 
in a lens, the water in a river or the air creating a mirage in the desert. General 
relativity has cultivated the theoretical tools for fields in curved geometries [611 197] . 
In this article we show how to use these tools for applications in electromagnetic or 
optical metamaterials. 

Metamaterials are materials with electromagnetic properties that originate from 
man-made sub- wavelength structures [80 11861 1120] . Perhaps the best known metama- 
terials are the materials used in the pioneering demonstrations of negative refraction 
[127] or invisibility cloaking |122j of microwaves, see Fig. fll or for negative refraction 
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Figure 1: Cloaking device. (From Ref. |122] . Reprinted with permission from AAAS.) 
Two-dimensional microwave cloaking structure (background image) with a plot of the ma- 
terial parameters that are implemented. The cloaking device is made of circuit-board with 
structures that are about an order of magnitude smaller than the wavelength. The struc- 
tures are split-ring resonators with tunable magnetic response. The split-ring resonators 
of the inner and outer rings are shown in expanded schematic form (transparent square 
insets). 



of near- visible light [129j . These materials consist of metallic cells that are smaller 
than the relevant electromagnetic wavelength. Each cell acts like an artificial atom 
that can be tuned by changing the shape and the dimensions of the metallic struc- 
ture. It is probably fair to regard microstructured or photonic-crystal fibres |118j as 
metamaterials as well, see Fig. |2j Here sub-wavelength structures — airholes along 
the fibre — significantly influence the optical properties of the fused silica the fibres 
are made of. Metamaterials have a long history: the ancient Romans invented ruby 
glass, which is a metamaterial, although the Romans presumably did not know this 
concept. Ruby glass |148] contains nano-scale gold colloids that render the glass 
neither golden nor transparent, but ruby, depending on the size and concentration 
of the gold droplets. The color originates from a resonance of the surface plasmons 
[7] on the metallic droplets. Metamaterials per se are nothing new: what is new is 
the degree of control over the structures in the material that generate the desired 
properties. 

The specific starting point of our theory is not new either. In the early 1920's 
Gordon [40j noticed that moving isotropic media appear to electromagnetic fields as 
certain effective space-time geometries. Bortolotti [12] and Rytov |119j pointed out 
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Figure 2: Photonic-crystal fibres. (From Ref. |118] . Reprinted with permission from 
AAAS.) An assortment of optical (OM) and scanning electron (SEM) micrographs of 
photonic-crystal fibre (PCF) structures. (A) SEM of an endlessly single-mode solid core 
PCF. (B) Far-field optical pattern produced by (A) when excited by red and green laser 
Ught. (C) SEM of a recent birefringent PCF. (D) SEM of a smah (800 nm) core PCF 
with ultrahigh nonlinearity and a zero chromatic dispersion at 560-nm wavelength. (E) 
SEM of the first photonic band gap PCF, its core formed by an additional air hole in a 
graphite lattice of air holes. (F) Near-field OM of the six-leaved blue mode that appears 
when (E) is excited by white light. (G) SEM of a hollow-core photonic band gap fiber. 
(H) Near-field OM of a red mode in hollow-core PCF (white light is launched into the 
core). (I) OM of a hollow-core PCF with a Kagome cladding lattice, guiding white light. 



that ordinary isotropic media establish non-Eudidean geometries for hght. Tamm 
|134t 1135] generahzed the geometric approach to anisotropic media and briefly ap- 
phed this theory |135j to the propagation of hght in curved geometries. In 1960 
Plebanski |112] formulated the electromagnetic effect of curved space-time or curved 
coordinates in concise constitutive equations. Electromagnetic fields perceive media 
as geometries and geometries act as effective media. Furthermore, in 2000 it was un- 
derstood [67] that media perceive electromagnetic fields as geometries as well. Light 
acts on dielectric media via dipole forces (forces that have been applied in optical 
trapping and tweezing [221 El])- These forces turn out to appear like the inertial 
forces in a specific space-time geometry. This geometric approach [71J was used to 
shed light on the Abraham-Minkowski controversy about the electromagnetic mo- 
mentum in media |2l [75l [901 HOO] . Geometrical ideas have been applied to construct 
conductivities that are undetectable by static electric fields [l3l[ll], which was the 
precursor of invisibility devices [5], [38], [72], [73], [87], I104[ 1123] based on implementations 
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of coordinate transformations. From these recent developments grew the subject of 
transformation optics. Here media, possibly made of metamaterials, are designed 
such that they appear to perform a coordinate transformation from physical space 
to some virtual electromagnetic space. As we describe in this article, the concept 
of transformation optics embraces some of the spectacular recent applications of 
metamaterials. 

Transformation optics is beginning to transform optics. We would do injustice 
to this emerging field if we attempted to record every recent result. By the time 
this article goes to press, it would be outdated already. Instead we focus on the "old 
things in new things" , because those are the ones that are guaranteed to last and to 
remain inspiring for a long time to come. This article rather is a primer, not a typical 
literature review. We try to give an introduction into connections between geometry 
and electromagnetism in media that is as consistent and elementary as possible, 
without assuming much prior knowledge. We begin in §2 with a brief section on 
Fermat's principle and the concept of transformation optics. In §3 we develop in 
detail the mathematical machinery of geometry. Although this is textbook material, 
many readers will appreciate a (hopefully) readable introduction. We do not assume 
any prior knowledge of differential geometry; readers familiar with this subject may 
skim through most of §3. After having honed the mathematical tools, we apply them 
to Maxwell's electromagnetism in §4 where we develop the concept of transformation 
optics. In §5 we discuss some examples of transformation media: perfect invisibility 
devices, perfect lenses, the Aharonov-Bohm effect in moving media and analogues 
of the event horizon. Let's begin at the beginning, Fermat's principle. 

2 Fermat's principle 

In a letter dated January 1st, 1662, Pierre de Fermat formulated a physical principle 
that was destined to shape geometrical optics, to give rise to Lagrangian and Hamil- 
tonian dynamics and to inspire Schrodinger's quantum mechanics and Feynman's 
form of quantum field theory and statistical mechanics. Fermat's principle is the 
principle of the shortest optical path: light rays passing between two spatial points 
A and B chose the optically shortest path, see Fig. |3| In some cases, however, light 
takes the longest path; in any case, light rays follow extremal optical paths, see Fig. 
|4j The optical path length s is defined in terms of the refractive index n as 



in Cartesian coordinates. If the refractive index varies in space — for non-uniform 
media — the shortest optical path is not a straight line, but is curved. This bending 
of light is the cause of many optical illusions. For example, picture a mirage in the 
desert [35]. The tremulous air above the hot sand conjures up images of water in 
the distance, but it would be foolish to follow these deceptions; they are not water, 
but images of the sky. The hot air above the sand bends light from the sky, because 
hot air is thin with low refractive index and so light prefers to propagate there. 




(2.1) 
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Figure 3: Fermat's principle. Light takes the shortest optical path from A to B (solid 
line) which is not a straight line (dotted line) in general. The optical path length is 
measured in terms of the refractive index n integrated along the trajectory. The greylevel 
of the background indicates the refractive index. The figure illustrates the ray trajectories 
involved in forming a mirage. 



Fermat's principle has profoundly influenced modern physics, and like most if 
not all profound discoveries it has deep roots in the history of science. Fermat was 
inspired by the Greek polymath Hero of Alexandria's theory of light reflection in 
mirrors. The Arab scientist Ibn al-Haytham anticipated Fermat's principle in his 
Book of Optics (written under house arrest in Cairo from 1011 to 1021). Fermat's 
principle was instantly greeted with objections, because it appears to violate causal- 
ity — it presumes an idea of destiny. The principle governs the path between A 
and B if it is known that light travels from A to B; Fermat's principle shows how 
the ray's destiny is fulfilled, but it does not explain why the light ray arrives at 
B and not at some other end point. Wave optics resolves this problem, because a 
wave emitted at A propagates in all directions (but possibly with greatly varying 
amplitude). The path of extremal optical length ( |2.1 ) is the place of constructive 
interference between all possible paths. 

We will derive Fermat's principle later, in Sec. 4.3. The question we pose here 
is this: imagine one illuminates a non-uniform medium with a grid of light rays, see 
Fig.|5} Each ray is curved according to Fermat's principle. Is it possible to transform 
away the curvature of the grid? In this case the curved path of each ray would appear 
as a straight line in some transformed space. So, in other words, is it possible that 
the bending of light is an illusion of choosing the wrong coordinates? Curvature 
would be an illusion of Cartesian linear thinking. Such transformable media may 
create optical illusions by themselves, in fact they may create the ultimate illusion, 
invisibility: if the transformable grid contains a hole, anything inside the hole is 
invisible. The transformation medium acts as a cloaking device. 

For simplicity, imagine a two-dimensional situation where the refractive index 
varies in x and y, and light is confined to the x, y plane. Consider a coordinate 
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Figure 4: Fermat's principle: longest optical path. The figure shows a simple example 
where light takes the longest optical path: light beyond the focal point of a lens, traveling 
along a straight line from A to B. To see why the optical path from A to B is the longest, 
compare the solid line of the actual path with an example of a virtual path, with the 
dashed and dotted paths, and note that the optical path length between the focal points 
A and A' of the lens is always the same, regardless whether light travels along a straight 
line or is refracted in the lens. Therefore the optical path length taken differs from the 
virtual path length by the difference between the two short sides and the long side of the 
triangle from L to A' and B. The sum of the two short sides of a triangle is always longer 
than the long side: light has taken the longest optical path. 



transformation from x and y to x' and y'. Is Fermat's principle obeyed in transformed 
coordinates, 



The reader easily sees from 

dx' dx' ^y' A > A 

dx = —dx+ —dy , dy = — dx + — dy 
ox ay ox ay 



(2.2) 



(2.3) 



(2.4) 



that dx'^ + dy'^ is proportional to dx^ + dy"^ if 

dx' dy' dy' dx' 
dx dy ' dx dy 

In this case the coordinate transformation changes (dx^+dy^) into n''^ (dx'^+dy'^ 
thus preserving the form (2.1) of Fermat's principle. One obtains 

2 



dx' 



dy' 



A transformation with the property (2.4) is known as a conformal transformation 
in two-dimensional space [HS]. Conformal transformations leave Fermat's principle 



Figure 5: Optical conformal mapping ^72j. Suppose that an optical medium performs 
a coordinate transformation from a straight Cartesian grid (left) to curved coordinates in 
physical space (right). The trajectories of light rays follow the curved coordinates, but this 
apparent curvature is an illusion and may be used to create optical illusions, for example 
invisibility. 



(2.1) intact, they correspond to materials with an isotropic refractive index profile. 
If n' = 1, the transformed space is empty; light would propagate along a straight 
line there: the refractive-index profile acts as a transformation medium. 



So far we have discussed light rays. How does the conformal transformation (2.4) 
act on light waves? Suppose that both amplitudes ip of the optical polarization 
satisfy the Helmholtz equation 

V2 + — riM ^ = (2.6) 

where u denotes the frequency and c the speed of light in vacuum. It is convenient 
to write the Laplacian as 

^2_^ ^_(_^ (2 7) 

g^2 Qy2 \^dx dy J \dx dy J ' 



We obtain from the differential equations (2.4) of the conformal map the transfor- 
mation 

d ■ \ ( "9 dy' d . dx' d . dy' d 
dx dy J \ dx dx' dx dy' dy dx' dy dy' 

dx' dy'\ ( d d 



dx dx ) \dx' dy' 
dy dy ) \dx' dy' 
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and, by exchanging i with — i, 



d 



+ i 



d 



dx' . dy' 



\dx dy J \ dx 
Furthermore, since 



dx 



d 



+ i 



d 



dx' dy' 



dy' . dx' 



+ 1 



dy dy J \dx' dy' 



d 



+ i 



d 



d ■ ^ \ ( ■ '^y'\ "9^^' '^^y' ■ 9^y' ■ 

dx dy J \ dx dx J dx"^ dxdy dx"^ dxdy 



(2.9) 
(2.10) 



the Laplacian is transformed into 



dx' . dy'\ / dx' . dy' 
dx dx J \ dx dy 



d 



d 



d_ . d_ 

dx' dy' J \dx' dy' 



\dx J \dx 
\dy J \dy 



(2.11) 



Consequently, the Helmholtz equation (2.6) is invariant under conformal transfer 



mations if the refractive index is transformed according to Eq. (2.5). Waves are 
transformed in precisely the same way as rays. 



There is an elegant short-cut to the theory of this optical conformal mapping 
that allows us to condense the previous calculations in a few lines: complex analysis 
[H, 122] • Suppose we denote the two-dimensional coordinates by complex numbers 



z = x + iy, z* = x — iy. 



From 



we obtain 



and hence 



d dz d dz d 
+ 



d d 
+ 



dx dx dz dx dz* dz dz* 

d\ d I 



d_ 
dy 



d 



d 



dz dz* 



d__l fd_ 
dz 2 \dx 



_ _ A • A 

dy J ' dz* 2 \dx dy 



d^ 



dzdz* 



(2.12) 
(2.13) 
(2.14) 
(2.15) 



Consider a coordinate transformation described by a function w{z) that depends on 
z, but not on z*, 

dw 

x' + iy' = w{z) with ^ = 0. (2.16) 



Since 



dz* 



^dw f^j^-^\(ij^-i\ ^y' _|_ • _|_ • '^y' 

dz* \dx dy J dx dy dy dx 



(2.17) 
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the differential equation (2.4) of conformal maps are naturally satisfied; they are the 
Cauchy-Riemann differential equations of analytic functions [H [221 123] ■ Finally we 
obtain from 



dw dw* 



dzdz* 



dz dz* dwdw* 



dw 



dz 



(2.18) 



the relationship (2.5) between the original and the transformed refractive-index pro- 



file in the Helmholtz equation (2.6) as 



n 



dw 



dz 



n . 



(2.19) 



Complex analysis not only simplifies the theory, it provides optical conformal map- 
ping [72] with a vast resource in calculational tools and geometrical insights [U l92| 
[93]. 

Conformal coordinate transformations represent a special case; most spatial 
transformations are non-conformal, and we could also envision transformations that 
mix space and time. Consequently, media that implement such transformations are 



not subject to Fermat's principle in the form (2.1). Furthermore, the Helmholtz 
equation (2.6) is only approximately valid [llj. Light should be described as an 



electromagnetic wave subject to Maxwell's equations. There are various ways of de- 
veloping the concepts of transformation media for the general case. In §4 we discuss 
a theory that perhaps plays a similar role to that played by complex analysis in op- 
tical conformal mapping. It will equip the reader with calculational short-cuts and 
geometrical insights. For this theory we borrow concepts from general relativity, but 
we do not assume that the reader is familiar with them. The necessary ingredients 
from differential geometry are derived by elementary means in the following section. 



3 Arbitrary coordinates 

The theory of transformation media requires consideration of Maxwell's equations 
in arbitrary coordinates. This means that the natural mathematical language of 
transformation media is differential geometry, the mathematics that also describes 
curved spaces and Einstein's general relativity. Here we introduce the reader to the 
mathematics of arbitrary coordinates, to the extent necessary to deal with coordi- 
nate transformations of Maxwell's equations. The reader will see that this formalism 
provides the most transparent way of describing non-Cartesian coordinate systems: 
even something as apparently familiar as electromagnetism in spherical polar coor- 
dinates is much simpler in the language of differential geometry than in the standard 
treatment found in the textbooks |3 

3.1 Coordinate transformations 

We deal first with spatial coordinates; the extension to space-time coordinates will 
then be straightforward. Our interest is in writing equations that are valid in an 

^The introduction to curved coordinates given here is broadly similar to that in Schutz's excel- 
lent text [US]. 
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arbitrary spatial coordinate system {x\ i = 1,2,3} and in performing an arbitrary 
transformation to another set of coordinates that we distinguish from the original by 
a prime on the index: {x* , i' = 1, 2, 3}. Throughout, we take as concrete examples 
Cartesian coordinates {x,y,z} and spherical polar coordinates {r,6,(j)}, related by 



{x'} = {x, y, z}, {x''} = {r, 6, 0}, 
X = r sin 6 cos 0, r = a/x^ + + 2;^, 



y = r sin 6 sin 0, 6 = tan ^ (a/x^ + y'^/ z) , 
2; = rcos6' = tan~^(?//z). 



(3.1) 



At this point we introduce the Einstein summation convention in which a summation 
is implied over repeated indices; for example 

A'Bi = ^ A'Bi = A^Bi + A'^B2 + A^B^. (3.2) 

i 

This convention allows us to dispense with writing summation signs, which are 
completely unnecessary. For reasons that will become clear later on, our summations 
will generally be over a pair of indices in which one index is a subscript and one is 
a superscript, as in Eq. (3.2). We also introduce the Einstein range convention by 
which a free index {i.e. an index that is not summed over) is understood to range 
over all possible values of the index, for example 

A' = {A\ i = 1,2,3}, (3.3) 

Together, the summation and range conventions allow an economy of notation such 
as the following: 

^^.fc^jE^V'^' J,k = 1,2,3^. (3.4) 

The differentials of our two sets of coordinates, x* and x*', are related by the chain 
rule: 

dx^ = ^dx^ dx^' = — dx\ (3.5) 
with similar relations holding for the differential operators: 

d (9x* d d 9x*' d 

(3.6) 



dx^' (9x*' (9x*' 9x* (9x* 9x*' 



We denote the transformation matrices in Eqs. (3.5) and (3.6) by 

dx^ . i' <9x*' , 
A\, = — , A\ = — . 3.7 

dx' ' dx' ^ ^ 

Note that primes or unprimed indices in A*^, and A*!, do not mean that we simply 
use different indices: A* ■, and A*,- are different matrices where we differentiate with 
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respect to different sets of coordinates. Tlie reader may verify tliat for the example 
(3.1) the transformation matrices (3.7) are 



sin 9 cos (f) r cos 6 cos (p sin 6 sin > 
sin 9 sin 6 r cos 9 sin 6 r sin 9 cos d 



cos 9 



-r sin ^ 







(3.8) 



A' 



/ X 
r 

y 



xz 



\ 



z 
r 



y/x^ + y"^ 



X 



( sin ^ cos <\) 

- cos COS ( 



sin 9 sin i 



1 



CSC 9 sin ( 



X 

r 

xz 



y 



cos 9 sin < 
CSC 6^ cos ( 

y_ 

r 

yz 



\ + ?/2 + y'^ 



cos 



— sin 9 

r 



z 

r 

y/x^ + 




(3.9) 



(3.10) 



(3.11) 



From Eqs. (3.5) and (3.7) we find dx* = A*j,dx* = A*j,A'^dx-^ and dx' = A^dx* 
A^'jA^^vdx-'', which imply 



6' J, A\A'j, = 



(3.12) 



where 5*^- and 5*'-, are the Kronecker delta, the matrix elements of the unity matrix. 
Equations (3.12) state that the matrices A*-, and are the inverses of each other. 
This property can be deduced directly from the definitions (3.7) and the chain rule. 
The reader may verify the relations (3.12) for the example (3.8 )-(3.11 ). 



3.2 The metric tensor 

Although we have awarded ourselves the freedom of covering space with any coordi- 
nate system we wish, the distances between points in space are invariant — they are 
the same no matter which coordinates we use to calculate them. The basic quantity 
is the square of the infinitesimal distance ds between the points x* and x* + dx*. For 
Cartesian coordinates x* = {x, y, z} this is given by the 3-dimensional Pythagoras 
theorem: 

ds^ = dx^ + d|/2 + dz^ = 5ijdx'dx\ (3.13) 
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where 5ij is again the Kronecker delta. For general coordinates x*, the square of the 
line element ds^ is given by an expression quadratic in the coordinate differentials 
dx*: 

ds^ = Qijdx'dx^. (3.14) 



In Eq. (3.14) we have introduced the metric tensor gij, the quantity that allows us 
to calculate distances in space. The metric tensor is always symmetric in its indices, 



9ij 9jiy 



(3.15) 



for the following reason: a matrix can always be written as a sum of its symmetric 
and antisymmetric parts, and the reader can verify that an antisymmetric part of 
gij would not contribute to the distance (3.14). So it only makes sense to consider 
a symmetric metric tensor. In Cartesian coordinates (3.13) the metric tensor is the 
Euclidean metric 6ij. 

We can write the relation (3.14) also in the coordinate system , denoting the 
metric tensor in this system by gi>j'] from the invariance of ds we have 

ds^ = gi'j'dx'-'dx^' = gijdx^dx^ = gijA\,A^ j,dx''' dx^' , (3.16) 

where we have used Eqs. (3.5) and ( |3.7[ ) in the second line. Equation (3.16) reveals 
how the metric tensor changes under a coordinate transformation: 



9i'j' 



(3.17) 



Writing the metric tensors gi'j' and gij as matrices G' and G we can display the 
transformation procedure in the matrix form 



G' = A' GA 



(3.18) 



where A denotes the transformation matrix A*-, defined in Eq. (3.7), whereas A*^ is 
the inverse matrix A~^. 

Consider a transformation from a Cartesian coordinate system, in which g^j = Sij, 
to another Cartesian system, by means of a rotation. As the new coordinates are 
Cartesian, the transformed metric must also be Euclidean and Eq. (3.17) shows that 
this is the case: 



9: 



6i'j' (Cartesian — > Cartesian) 



(3.19) 



where the second equality follows from the fact that rotations are performed by 
orthogonal matrices with A^ = A~^. Rotations thus preserve the Euclidean metric. 

The metric tensor not only characterizes the measure of length in arbitrary coor- 
dinates, it turns out to describe the volume element as well. To see this, we represent 
the Cartesian volume element, dV^ = dxdydz, in arbitrary coordinates according to 
the standard rule 



dV = IdetAI dV. 



From the matrix representation (3.18) follows 

g' = (detAYg 



(3.20) 
(3.21) 
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where g and g' denote the determinants of the metric tensors. Note that g' is always 
positive, because the determinant g of the Euclidian metric is unity. Consequently, 
we obtain the volume element 

<W=y^'<W. (3.22) 

Dropping the primes, we note that ^/gdV always describes the volume element, in 
Cartesian or curved coordinates. 

Returning to our example of spherical coordinates, we can use the transformation 



procedure (3.17) to compute the metric tensor and the volume element: the metric 



tensor in Cartesian coordinates is gij = Sij and the required transformation matrix 



is expressed in Eq. (3.8); so we obtain 



1 
r2 
sin^ 9 

gi'j'dx^'dx^' = dr^ + r'^{d9^ + sin^ 9 ( 



(3.23) 



(3.24) 



The volume element (3.22) is given by the square root of the determinant of the ma- 
trix (3.23 ); we arrive at the familiar spherical volume element dV^ = sin 9 dr d9 d0. 



3.3 Vectors and bases 



The transformation relations (3.5)-(3.7) determine the transformation properties of 



vectors, and of more general objects. The coordinate displacements dx* are the 
components of a vector in space, therefore the components of a general vector V 
will transform in the same way under a change of coordinates: 



(3.25) 



The components and in (3.25) refer to an expansion of V in terms of the 



basis vectors Cj and Cj/ associated with each coordinate system: 

V = V'ei = V''ei, 



(3.26) 



The second line in Eq. (3.26) was obtained by use of Eq. (3.25) and comparison with 



the first line gives the transformation of the basis vectors: 

e,;/ = A\.,e,;. 



(3.27) 



In Cartesian coordinates = {x, z} the basis vectors are the familiar unit vectors 
in the x-, y- and z directions: 



(3.28) 



From (3.27) and (3.8) we then obtain the basis vectors in spherical polar coordinates 
as Cj/ = {Br, eg, e^} with 

sin 9 cos (f)i + sin 9 sin j + cos 9 k, 
r cos 9 cos (pi + r cos 9 sin j — r sin 9 k, 
-r sin^sin0i + r sin^cos0j. (3.29) 
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3.4 One-forms and general tensors 



The expression (|3.14|) is the squared length of the vector dx*, and the metric tensor 

(3.30) 



similarly gives the squared length of a general vector: 



\V\' = V-V = gijV'V^ = QiiyV . 



In Eq. (3.30) we have used the fact that the length of a vector is an invariant 



quantity, and the equality of the expressions evaluated in the coordinate systems 
and can be explicitly shown from the transformation rules (3.17), (3.25) to- 



gether with the inverse relations (3.12). Note that the invariance of Eq. (3.30) under 



coordinate transformations works because the vector and metric Qij are trans- 
formed by matrices that are inverse to each other. This inverse relationship of their 
transformations is to be associated with the fact that is an upper-index object 



whereas gij is a lower-index object. Each summation in (3.30) over an upper and 
lower index, called a contraction, is a coordinate-invariant operation because of this 
inverse property. It is useful to construct from and Qij a lower-index quantity Vi 
that transforms in the manner of Qij, as follows: 



(3.31) 



The transformation rule for Vi is easily established from those of and Qij-. 

V, = g,,.V^' = A\,g^,V' = A\,V,, (3.32) 
so that Vi does indeed transform similarly to Qij. The quantities Vi are the com- 



ponents of a covariant vector or one-form. We can view Eq. (3.31) as lowering the 



index on the vector V^ using the metric tensor, producing the associated one-form 
Vi- Defining the inverse metric tensor g^^ by 



we obtain from relation (3.31 ) 



r = g'^V,. 



(3.33) 



(3.34) 



Equation (3.34) can be regarded as raising the index on the one-form Vi using the 



inverse metric tensor, producing the associated vector V\ Note that the vector V^ 
and one-form Vi are the same only in Cartesian coordinates where gij = 6ij. The 
expression (3.30) for the squared length of a vector is more compactly written using 

V - V = ViV^. The scalar product of two vectors V and U 



the one-form V^: |V| 
is 



V U = gijV'U' = VU' = V'Ui = V'U' = V' Ui>, (3.35) 

where the coordinate invariance of the scalar product follow from the transformation 
properties of vectors and one-forms. From the position of the indices on the inverse 
metric tensor g'^^ we expect it to transform in a vector-like fashion. We verify this by 



writing the vector and one-form in Eq. (3.34) in terms of their transformed values, 
Ai.yi' = g'^A^'.V' from which follows V'' = A'^g'^ A^Wj> = g''^'V' so that 



(3.36) 
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The transformation rule for a tensor with an arbitrary collection of indices is 
now clear; we give as an example a four-index tensor: 



R 



j'k'l' 



jkl- 



(3.37) 



One can also raise and lower the indices of a general tensor in complete analogy 



to Eqs. (3.34) and (3.31). In this manner Eq. (3.33) may be regarded as a raising 
or lowering operation so that the Kronecker delta 5*^ is the metric tensor with one 
index raised, or the inverse metric tensor with one index lowered. 

It is straightforward to introduce bases for general tensors, but we will not pursue 
this here. The reader should, however, be able to deduce the index positions and 
transformation properties of any tensor basis, starting with the one-form basis. 



3.5 Coordinate and non-coordinate bases 



The scalar product of two basis vectors is seen from the definition (3.35) to be 



9ij) 



(3.38) 



since the components {CiY of a basis vector are 6-'^. The basis vectors (3.28) in 
Cartesian coordinates constitute an orthonormal basis. For the spherical polar basis 



(3.29) we can compute the dot products using the right-hand sides of Eqs. (3.29) 



or, much more simply, by using the scalar product (3.38) and the metric (3.23) in 
spherical polar coordinates. We see that the basis vectors are orthogonal to each 
other, but they are not all unit vectors: 



eJ 



' sin^ 6. 



(3.39) 



One can of course easily construct an orthonormal basis Cj by rescaling eg and 



1 



1 



60, 



(3.40) 



The reader unfamiliar with the material of this section will only have encountered 



spherical polar coordinates in combination with the orthonormal basis (3.40). How 



did we end up with the non-orthonormal basis? The answer is that we let the co- 
ordinates induce our basis through their differentiable structure. Recall that the 
components of a vector were introduced by analogy with the coordinate differentials 
dx*. The discussion of vector components then immediately specifies the basis as in 



Eqs. (3.25)-(3.27). Such a basis, induced naturally by the coordinates, is called a 



coordinate basis. The fact that the differentiable properties of the coordinates com- 



pletely determine the coordinate basis is the reason why the coordinate bases (3.27) 
behave exactly like the partial derivative operators in ( 3.6| ) The orthonormal basis 
(3.40 ), by contrast, is not induced in a similar manner by any coordinate system — it 



is a non- coordinate basis (see Ref. [125] for more detail). The only coordinate system 
that induces an orthonormal coordinate basis is the Cartesian system. It might be 



^This is not just a pleasant correspondence; in modern differential geometry the partial deriva- 
tive operators are the coordinate basis vectors. 
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suspected that it is always simpler to work in an ortho normal basis; in fact, for most 
purposes a coordinate basis is much simpler, in particular for the manipulations in 
curvilinear coordinates performed in electromagnetism textbooks. The reason these 
texts use the more complicated non-coordinate bases is that to exploit the simplicity 
of coordinate bases requires a little knowledge of tensor analysis. 



3.6 Vector products and Levi-Civita tensor 

For describing electromagnetism in arbitrary coordinates, we need to define the 
notion of vector products in three-dimensional space. Vector products V x U are 
antisymmetric, V x U = —U x V, and the vector products of the Cartesian basis 
vectors are cyclic: ixj = k, jxk = i and k x i = j . For implementing these 
properties, we introduce the permutation symbol [ijk] defined by 

{+1 if ijk is an even permutation of 123, 
— 1 if ijk is an odd permutation of 123, (3-41) 
otherwise. 

We define the Levi-Civita tensor e^^^ as the tensor whose components in some partic- 
ular right-handed Cartesian coordinate system are given by the permutation symbol: 

= [ijk], (Right-handed Cartesian coordinates). (3.42) 

We can find the components of the Levi-Civita tensor in any other coordinate system. 



Cartesian or otherwise, by transforming the expression (3.42) according to the rule 



(3.37); we constructed a tensor by definition. Specifically, the Levi-Civita tensor in 



an arbitrary coordinate system is 



= A^\A^'^A^'^[^Jk] = det{A',)[^' fk'] = ^ (3.43) 

using the Leibniz formula for the determinant of A''^ |131j . As in Sec. 3.2, det A is 
the determinant of the transformation matrix A';,, the matrix inverse of A''^. We 



obtain from Eq. (3.21) 



detA = ±v^. (3.44) 

Which sign should we take here? Clearly the sign in question is the sign of det A. 
If det A is negative the transformation changes the handedness of the coordinate 
system, so the new system is left-handed [39]. For example, a transformation that 
changes the sign of one or all three of the coordinates in the right-handed Carte- 
sian system has det A = —1 and results in a left-handed Cartesian system. A 
general transformation between Cartesian coordinate systems consists of a rotation 
and a translation, together with possible refiections of the coordinates. For any 



such transformation the Euclidean metric is preserved so Eq. (3.19) holds and the 



transformation matrix is orthogonal; it then follows from Eq. (3.44) that 



detA = ±l (Cartesian — > Cartesian), (3.45) 
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where the sign is negative if the transformation includes a handedness change. Using 



the relationship (3.44) we can now write the Levi-Civita tensor in arbitrary coordi- 



nates (3.43) in terms of the metric; we drop the prime on the indices of the arbitrary 



coordinate system and obtain 



(3.46) 



where it is understood that the plus (minus) sign obtains if the system is right- 
handed (left-handed). It is now clear that Eq. (3.42) holds in all right-handed 



Cartesian coordinate systems, not just in the one we started with. 

Some or all of the indices of e'^^^ can be lowered according to the prescription 



(3.31); if all are lowered we obtain another simple expression to complement the 



representation (3.46) 



(^ijk QilQjmQkn^ 



Imn 



±^giigjmgkn[lmn] 
V 9 



±^g[ijk] = ±y^[ijk]. (3.47) 



The Levi-Civita tensor is completely antisymmetric; this means that when its compo- 
nents are taken with all indices in the upper or lower position they are antisymmetric 
under interchange of two adjacent indices, e.g. e^^'' = — e-'*'^. 

The Levi-Civita tensor is required to compute vector products in an arbitrary 
coordinate system: 



UxV = e'^%Vkei. 



(3.48) 



The reader can verify that (3.48) is the standard vector product in right-handed 



Cartesian coordinates; the definitions we have given show that it maintains the 
same form when transformed to an arbitrary coordinate system. The components 



of the vector product can be written in terms of Eqs. (3.46) or (3.47): 



([/ X Vy = e'^%Vk, {U X V)i = eijkU'V' 



(3.49) 



We can apply the Levi-Civita tensor for expressing the well-known double vector 
product in arbitrary coordinates. 



Ax{BxC)= B{A ■ C) - C{A ■ B). 



(3.50) 



We obtain from formulas (3.46) and (3.47) and the defining properties (3.41) of the 
permutation symbol: 



e ^klr. 



(3.51) 



Contracted with the components of three vectors Aj, and C™, this identity 
generates the double vector product (3.50). Curls are also computed using the 



Levi-Civita tensor, but they contain a differentiation and we must learn how to 
differentiate in arbitrary coordinates. 
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3.7 The covariant derivative of a vector 



A scalar field in space is a function of the coordinates and we can take its partial 
derivative with respect to x*; in writing the partial derivative we can introduce 
another ink-saving device, as follows: 



d_ 



(3.52) 



Thus a comma means partial differentiation, with the following index giving the 
coordinate with respect to which the derivative is taken. In Cartesian coordinates 



the derivatives (3.52) are of course the components of the gradient vector V?/'. 



It is easy to see, however, from Eqs. (3.6), (3.7) and (3.32), that the expression 



(3.52) transforms as a one-form. Consistent with the index being in the lower 



position, the derivatives (3.52) are in fact the components of the one-form associated 



with the gradient vector, and this distinction can only be ignored in Cartesian 
coordinates where the vector and the one-form have the same components. In a 



general coordinate system we must raise the index in Eq. (3.52) using the inverse 
metric tensor to obtain the components of the gradient vector, i.e. 



(3.53) 



is a vector and so it is a coordinate-independent object; the reader may verify 
that g'^^ipjBi is the same in every coordinate system using the transformation rules 
for the quantities involved, g^^tpjBi = g'^ ^ ipj/ei'. Note that if we were to use an 
orthonormal frame the components of the gradient one-form and vector would not 



be given by Eq. (3.53); this is easily seen in the case of spherical polar coordinates. 



where if we replace the coordinate basis (3.29) in Eq. (3.53) by the orthonormal, 
non-coordinate basis (3.40), the components (3.53) are rescaled. Thus in a non- 



coordinate basis the partial derivatives are not the components of the gradient 
one-form and this makes such a basis unsuitable for our tensor calculus. Partial 
derivatives with respect to the coordinates only have a simple tensorial meaning if 
we use the basis induced by those coordinates. 

A vector field in space V = V^Ci consists of a sum of products of scalar fields 
and basis vector fields e^; to differentiate V we must of course use the Leibniz rule: 



dx^ dx^ ^ dx^ 



(3.54) 



This simple relation represents the most important fact about curvilinear coordi- 
nates. In Cartesian coordinates the basis vectors are constant and to differentiate 
a vector we need only to differentiate its components. The coordinate basis vectors 
for any other coordinate system, however, change in orientation and magnitude as 
one moves through space, see Fig. |6| The rate of change of a vector is itself a vector 
and so can be expanded in terms of the basis e^; we therefore have 



dx^ 



(3.55) 
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Figure 6: Basis vectors change with position. Polar coordinates in two dimensions (spher- 
ical polar coordinates with 9 = it/2) are depicted on the left. The basis vectors are ev- 
erywhere orthogonal but they rotate as one moves through the plane. While er remains 
a unit vector, the length of e^, equal to r, varies from zero to infinity. A general two- 
dimensional coordinate system is shown on the right. Here the orientations and lengths 
of both basis vectors vary, as does the angle between them. Since the dot product ei • 62 
does not vanish, the metric tensor is not diagonal in these coordinates (gi2 7^ 0). 



The 27 quantities T^^-^ are called the Christoffel symbols; T^^-^ is the kth component 
of the derivative of Sj with respect to x\ 

Let us pause to consider how T^ j change under a coordinate transformation. In 



fe' 



the transformed system the Christoffel symbols are defined by deji / dx^' = T 
and since we know how the basis vectors and the partial derivative operators trans- 



form, Eqs. (3.27) and (3.6), we can deduce the transformation law for the Christoffel 



symbols. We leave it as an exercise for the reader to show that 



-pk' 



A'=lA^.,A\.,r^, -A^;.,A^.,A' 



(3.56) 



Note that T'^^j do not obey the transformation law (3.37) of tensor components; they 



do not therefore constitute a tensor. 



The transformation rule (3.56) reveals an important property of the Christoffel 
symbols: they are symmetric in their lower indices, i.e. 



jk 



kj- 



(3.57) 



To prove this, first write the second term in the transformation rule (3.56) explic- 



itly in term of the partial derivatives (3.7) and show that it is symmetric in j' 
and i'. Now, the Christoffel symbols in any coordinate system can be obtained 
by transforming from Cartesian coordinates according to the rule (3.56), but in 
Cartesian coordinates the T^j^ vanish. This shows that in the new coordinate sys- 
tem T%, = T%, 
coordinates. 



but this coordinate system is arbitrary, so (3.57) holds for any 
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Using the Christoffel symbols we write down the derivative (3.54) of a vector. 
Insertion of Eq. (3.55) in Eq. (3.54) gives 



_d_ 



(3.58) 



where we have defined the quantities 



(3.59) 



which give the components of the derivative of V with respect to x*. Just as the 
derivative of a scala r (a zero-index tensor) gives a one-form (a one-index tensor), 
the derivative (3.58) of a vector V gives a two-index tensor V-'.^ called the covari- 



ant derivative of V . The semi-colon in the definition (3.59) thus means covariant 



differentiation of the vector, in which both the components and the basis are differ- 
entiated, whereas the comma means differentiation of the components. "Covariant 
derivative" therefore just means "correct derivative" ! 

Suppose we transport the vector from a point to its infinitesimally close 
neighbor + dx* without rotating it or changing its length; this is called parallel 
transport. At the new point the coordinate basis has changed, but the vector has not 
changed; so the vector components must vary as well as the coordinate basis. Here 
it is important that we differentiate correctly: the covariant derivative of the vector 
along dx* vanishes, but the ordinary derivative of its components will not, unless we 
are in Cartesian coordinates. Imagine and its parallel-transported neighbour as 
being part of a vector field. We use "DV^ to denote the increment of along dx* 
and introduce an alternative notation for the covariant derivative, as the differential 
operator 

^ V\^, = + T\yK (3.60) 
The increment of along dx* is then 



DV^ = (ViV^) dx\ (3.61) 
In the case of parallel transport the vector simply remains the same: 

BV^ = 0. (Parallel transport.) (3.62) 



A nontrivial example of parallel transport is given by Foucault's pendulum 
consider a pendulum that is attached to the Earth, but free to oscillate (Foucault 
suspended one from the dome of the Pantheon in Paris in 1851). The direction in 
which a pendulum swings follows the rotation of the Earth, it is parallel-transported 
on the surface of a sphere. The pendulum slowly turns due to the non-Euclidean 
curvature of the sphere, as we discuss in Sec. 3.11. 

The covariant derivative of a vector is the tensor V^'.,^, so it must transform 



according to the tensor prescription (3.37). Note that neither of the two terms on the 



right-hand side of Eq. (3.59) separa tely c onstitutes a tensor. From the transforma- 



tion rule of the Christoffel symbols (|3.56l) we see t hat the transformation of T-' .-V^ 



will not adhere to the tensor transformation (3.37), and the same is true of V\] but 
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when they are added together, however, the result does obey the transformation 



rule (3.37) of a tensor, as the reader should verify. 

In standard vector calculus one is confined to scalars and vectors, and so one 
does not encounter the two-index tensor V-'.^ in its full glory, but only some aspects 
of covariant differentiation. For example, the divergence V • V is constructed from 
the covariant derivative V-'.^ by contraction of its two indices. To see this note that 
in Cartesian coordinates the divergence is V^%, which (only) in these coordinates is 
equal to V\j^. Now, since V''.^ is a tensor, the contraction is a scalar, the same 



in all coordinate systems, just like the dot product (3.35). We therefore have 

V-V 



(3.63) 



Let us return to the example of spherical polar coordinates to see what a set of 
Christoffel symbols looks like. We can compute the Christoffel symbols from Eqs. 



(|3.55|) and (|3.29|), or by transforming from Cartesian coordinates, in which F*^-^ are 



zero, using the rules (3.56) and (3.8)-(3.11). Clearly, the recipe (3.55) presents the 
easier path and we find 



(f>r 





1 








r 


1 






F 


r 





r(f) 

cote 



r 

-r sin^ 6*, 



r% = cot 6, 
— sin 9 cos 6*, 



(3.64) 



all the other Christoffel symbols vanishing. We can now compute the divergence 



(3.63) in spherical polar coordinates, applying the covariant derivative (3.59): 

d 



V ■ V 



o + + ^^'^ + ^ + cot e V' 

or od ocp r 



1^ 

J.2 



sin 6 86 



(3.65) 



Remember that in Eq. (3.65 ) we are using the coordinate basis (3.29 ); to find the ex- 



pression in the orthonormal, non-coordinate basis (3.40) requires an obvious rescal- 
ing of the vector components. 



3.8 Covariant derivatives of tensors and of the metric 



Our knowledge of how to differentiate scalars and vectors leads directly to the ex- 
pressions for the covariant derivatives of more general tensors. Consider the scalar 
product (3.35), written in terms of a one-form and a vector: UiV^. Since this is a 
scalar, the correct derivative is the ordinary partial derivative of a scalar field: 

{U,V'),j = Uy\^ + U,,,V\ (3.66) 

where we have employed the Leibniz rule. Let us rewrite this equation in terms of 
the covariant derivative (3.59) of the vector: 

mi. 



(3.67) 
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The left-hand side of Eq. (3.67) is a tensor, the gradient one-form of the scalar UiV^ 



so the right-hand side is also a tensor. Now everything here, except the quantity in 
brackets, has already been shown to be a tensor; therefore the quantity in brackets 
is also a tensor, it is the covariant derivative of the one- form t/,-: 



a, 



or, using the notion of the covariant derivative as a differential operator. 



(3.68) 



(3.69) 



The covariant derivatives depends on the character of the object that is differenti- 



ated, vector components (3.59) are differentiated differently than the components of 



one-forms (3.68). The index position of the semi-colon in the definitions (3.59) and 



(3.68) indicates this better than the differential operators (3.60) and (3.69). Note 



that the covariant derivative obeys the Leibniz rule: 



(3.70) 



The fact that Ui-j is a tensor can be also shown directly by proving it transforms 



according the the tensor rule (3.37), using the known transformation properties of 



the objects on the right-hand side of (3.68) 



One can deduce the expression for the covariant derivative of any tensor by 
constructing a scalar from it with vectors and one-forms and applying the above 
procedure. For example, expanding the derivative of Aj^UiV^ one finds the covariant 
derivative of a mixed tensor. 



A * 



■pm /I I _|_ -p* A 

jk m ~ mk j 



(3.71) 



The general rule is simple: high indices get a positive sign in front of the Christof- 
fel symbols and low indices a negative sign. An important case is the covariant 
derivative of the metric tensor itself: 



ikdlj jk9u 



9ij;k 9ij,k 

9'.. = 9\k + ^\k9'' + ^'ik9''- 



(3.72) 
(3.73) 



A highly significant property of the metric tensor now emerges if we consider the 
fact that, as a tensor, it transforms as 



9i'j';k' — j|^^k>9ij■,k■ 



{'i.^A) 



It is clear from Eq. (3.72) that the covariant derivative of the metric vanishes in 
Cartesian coordinates, where gij = 6ij and the Christoffel symbols are all zero. 



But Eq. (3.74) shows that if gij-k is zero in one coordinate system it is zero in all 
coordinate system^ so we have 

9ij;k = 0. (3.75) 



■^This is a general property of tensors as a consequence of their transformation rule (3.37): if a 
tensor vanishes in one coordinate system it vanishes in all of them. 



23 



Similar reasoning starting from Eq. (3.73) shows that 



0. 



(3.76) 



It is instructive to verify the property (3.75) exphcitly for spherical polar coordinates 



using the expressions (3.72) and (3.23) and the Christoffel symbols (3.64). 

It follows from Eqs. (|3.75|) and (|3.76|) that Vi,j = (c/ifc1/^)y = gik-jV" + gikV% 
g,uV\^ and 



{g^^yk);j = g^^-jVk + g^^Vi-j = g^'^Vi-j, which are further examples 



of the general index lowering and raising operations (3.31) and (3.34). Since Vi-j is 
a tensor, we can in fact raise either of its indices, so we have 



g'%k. 



(3.77) 



Equation (3.77) defines what it means to have a covariant-derivative index in the 



upper position. 



It is very important that Eqs. (3.75) and (3.72) serve to determine the Christoffel 



symbols in terms of the metric tensor. To see this, we insert the expression (3.75) 



in Eq. (3.72) and write it three times, with different permutations of the indices: 



9ij,k 
9ik,j 
~gjk,i 



^\jgik + ^\jgiu 
jigik - ^\igjh 



(3.78) 



In view of the symmetry (3.15) of the metric tensor and the symmetry (3.57) of the 
Christoffel symbols, the sum of the three lines gives 2giiT^ = gij^k + g-ikj — gjk,i- 
Employing the inverse metric tensor g^^ we finally find 

1 



jk 



-fi'* igij,k + gik,j — gjk,i)- 



(3.79) 



Equation (3.79) represents the most economic way of computing the Christoffel 



symbols in general. Again, it is highly instructive to take the spherical-polar metric 



(3.23) and re-calculate the Christoffel symbols (3.64) using the recipe (3.79) 



3.9 Divergence and curl 



We can utilize the expression (3.79) for the Christoffel symbols to find a very simple 
formula for the divergence of a vector in arbitrary coordinates. From the definitions 

(3.80) 



(3.63) and (3.59) the divergence is 

V ■ V = 



and inserting (3.79) gives 

V ■ V = + ^ g'^igi,,, - g,,,i)V^ + ^ g''gu,V= = V\, + ^ g^'gu.V^^ (3.81) 

as is seen by relabeling summation indices and employing the symmetry of the metric 
tensor gij and its inverse g'^K Then we use a general property of the determinant of 
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a matrix: the derivative of the determinant g with respect to the matrix element 
Qij gives g g^^ where g"^^ is the inverse matrix. (This property follows from Laplace's 
formula of expressing the determinant in terms of cof actors and Cramer's rule for 
the inverse matrix |131] .) Consequently, 



1 dg 



9ii, 



9 



2 ' 2g dgu 

and so we arrive at the simplified formula for the divergence 



(3.82) 



V ■ V 



1 

71 



(3.83) 



The advantage of this formula compared to the definition (3.80) is clear, because it 



only contains an ordinary partial derivative. It is easy to see from the metric (3.23) 



that in spherical polar coordinates the divergence formula (3.83) gives the previous 



result (3.65). 



Another derivative operation familiar from Maxwell's equations is the curl of 
a vector. Like the vector product ( 3.48 )-( 3.49), the curl V x V is formed using 



the Levi-Civita tensor. Since the partial derivatives of Cartesian coordinates are 
covariant derivatives in general coordinates, we have 



(V X V) 



k;j- 



(3.84) 



A simplification occurs in formula ( 3.84[ ), however: from the covariant derivative 
(3.68) one finds that the terms containing the Christoffel symbols cancel. We can 



therefore, in the case of the curl, use partial derivatives: 

(V X Vy = e*^V,,„ (V X Vy = e/'Vk,j. (3.85) 
In this way we have found convenient expressions for the mathematical ingredients 



of Maxwell's equations, the divergence (3.83) and the curl (3.85). 



3.10 The Laplacian 



We can use formula (3.83) to find the divergence of the gradient vector (V^/^)* of 
a scalar ip, which will give us the general expression for the Laplacian of a scalar. 
Note that we must use the gradient vector (V?/^)*, rather than the gradient one-form 

(vV')i 

get 

V^^ = V ■ (V^) = (V^)\, = (g'^t/j,)., = ^ {V9 9''^j) , ■ (3.86) 



since the divergence is defined for vectors. From Eqs. (3.53) and (3.83) we 



1 



For spherical polar coordinates we apply Eq. (3.23) and easily obtain the well-known 
result 



1^ 
dr 



+ 



1 



d 



dr J sin 9 89 



sm.9 



dip 
89 



+ 



1 



5V 



sin^ 9 



(3.87) 
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Any reader who has had the misfortune of having to work out this expression without 



using formula (3.86) from differential geometry should now appreciate the power of 
the machinery we have developed. As a further salutary example, consider the 
monochromatic wave equation for the electric field 



0. 



(3. 



This is the equation as it is usually written in the textbooks, but the notation in the 



first term is treacherous for the student. When the wave equation (3.88 ) is considered 



in curvilinear coordinates, for example to find the radiation modes in a waveguide 
[53] , the student is apt to think that the components (V^-E)* of the vector V^£^ are 
the Laplacians V^(£'*) of the electric field components — this is not true. The 
partial derivatives of Cartesian coordinates get replaced by covariant derivatives in 
a general system, so the vector V^-E has in fact components g^^E'^ .-.y. = E^'-'.j, which 
are not the Laplacians {g^^E^ of E^. The three Laplacians [g^^E^ are not the 



components of a vector. The correct wave equation (3.88) is thus 



UJ 



E''\, + —E' = 0. 



(3.89) 



Expressed in curvilinear coordinates the wave equation (3.88) provides three coupled 



equations for the components E\ In practice, the textbooks ensure that they only 
deal with those components of the wave equation (3.88 ) for which (V^-E)* is equal to 
V^(i?*), such as the z-component in cylindrical coordinates. But this only confirms 
in the student's mind the false believe that (V^-E)* is the same as V^(£'*), and in 
the future he or she may come a cropper as a consequence. 



3.11 Geodesies and curvature 



We are now acquainted with the mathematics of arbitrary coordinates in three- 
dimensional Euclidean space. But there is an extra bonus for our efforts — we can 
also deal with curved space. A notion of curvature on a space is naturally induced 
by the metric tensor: if coordinates exist in which the metric has the Euclidean form 



(3.13 ) the space is flat, otherwise it is curved. Curvature of a three-dimensional space 



is difficult to visualize, but a familiar curved two-dimensional space is provided by 
the surface of a sphere. If the sphere has radius a, then from Eq. (3.24) the metric 
on the surface is 



(3.90) 



There is no transformation to coordinates {a;^,a;^} in which this metric takes the 
Euclidean form 

ds^ = (dx^y + (dx^. (3.91) 
Note that the crucial feature of the sphere that prevents its metric being transformed 



to the Euclidean (3.91) is not that it is a closed space with a finite area; we can 



consider any finite patch of the sphere, ignoring its global structure, and we would 
still be unable to find a coordinate transformation to Eq. (3.91). The crucial fact 
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that makes the sphere a curved space is that we cannot form a patch of the sphere 
from a flat piece of paper without stretching the paper. Consider, in contrast, the 
surface of a cyhnder. A cyhnder can be formed by rolhng up a flat piece of paper, 
so this space must be flat. The metric on a cylinder of radius a, in cylindrical polar 
coordinates with r = a, is 



dz^ + a" d02 = d^2 + (d(a0))^ 



(3.92) 



which is of the Eulidean form (3.91), proving that it is flat. It is not important that 
the coordinate (j) in (3.92) is periodic; curvature is a local property of a space, in 



contrast to its topology. A cylinder and a plane have different topologies but they 
have the same curvature, namely zero. 

How can we quantify the curvature of a space? We first need to generalize 
the notion of a straight line to the case of curved spaces like the sphere. The key 
property of a straight line joining two points in flat space is that it is the shortest 
path between those points. In a curved space we can still construct the shortest line 
between two points and this is called a geodesic. For a sphere, the geodesies are the 
great circles, i.e. the circles whose centres are the centre of the sphere. A geodesic 
is a curve x*(^) in space, with parameter ^. The length s of the curve between two 
points ^ = and ,^ = ^2 is the integral of the line element (3.14), 



£.2 



ds 



6 



gijdx'{i) dxi(0 




(3.93) 



For the curve a;*(^) to be a geodesic the length (3.93) must be a minimum, so the 



variation 5s must vanish when we perform a variation 5a;* (^) of the curve (main- 
taining the parameter values ^1 and ^2 at the endpoints). But this is completely 
equivalent to the principle of least action in mechanics [60j with "Lagrangian" 



L 



de ■ 



The geodesic is therefore given by the Euler-Lagrange equations: 







d dL 

d^ dx^ 



dL_ 

dx^ 



d 
de 



QijX'^ 



2L 



• I • j 



(3.94) 



(3.95) 



Equation (3.95) determines a geodesic curve once an initial tangent vector (direction 
of the geodesic) da;*(^)/d^ is specified. The parameter ^ is arbitrary, but we obtain 
a much simpler equation if we choose ^ to be the distance s along the geodesic, 
because in this case L = 1. We calculate 



d dL 

d^ dx^ 



d dx* 



9ij 



d^x* 
d^ 



+ 9ij, 



dx' dx-' 

'' ds ds 



(3.96) 



and insert this result in the Euler-Lagrangian equations (3.95) that we matrix- 
multiply with g"^^ from the left. Recalling the expression (3.79) for the Christoffel 
symbols we obtain the simple geodesic equation 

dV(s) , dx^(s)dx^'(s) ^ Q ^gg^^ 



ds2 



jk 



ds 



ds 
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In Euclidean space the geodesic equation (3.97) always gives the equation of a 
straight line, whether this line is expressed in Cartesian or curved coordinates, as the 
reader may verify for the case of the spherical polar coordinates using the Christoffel 



symbols (3.64). The reader may also show that the geodesies for the sphere, with 



metric (3.90), are the great circles. 



We find a simple interpretation for the geodesic equation (3.97) using the concept 
of parallel transport discussed in Sec. 3.7. Note that the vector dx^ds appearing 
in the geodesic equation is the tangent vector to the geodesic curve a;*(s). Equation 



(3.97) thus means that the covariant increment of the tangent vector along dx\s) 



is zero: the tangent vector is parallel-transported along the geodesic curve: 

BW = [VjU') dx'is) =0, U' = (3.98) 

Thus, a geodesic parallel-transports its tangent vector. In flat space, as one moves 
along a geodesic (straight line), the tangent vectors at all points are parallel. Equa- 



tion (3.98) generalizes this property for geodesies in curved space. Geodesies are 
lines of inertia. 

The behaviour of geodesies in a space can be used to quantify the amount of 
curvature. In flat space two geodesies (straight lines) either have a constant sepa- 
ration (parallel lines) or the separation distance changes at a constant rate, i.e. it 
changes linearly with distance along the lines. The second derivative of the separa- 
tion with respect to distance along the geodesies is therefore zero. In curved space 
by contrast, this second derivative does not vanish, a phenomenon called geodesic 
deviation; it is used to measure the curvature. As an example, consider again the 
surface of a sphere, depicted in Fig. [7} We see clearly that the separation of any two 
geodesies (great circles) does not change linearly with distance along the geodesies: 
for, the separation increases from zero to a maximum and then decreases to zero 
again. At the points where the tangent vectors to the geodesies are drawn in Fig. [7], 
the separation is a maximum and the tangent vectors are parallel as viewed in the 
three-dimensional space in which the sphere is embedded. The geodesies are parallel 
at these points, inasmuch as a notion of parallelism can be introduced on the sphere, 
but these "parallel lines" meet! Thus the postulates of Euclidean geometry do not 
hold in this space: it is curved. As another example, consider the geodesies on the 
surface of a cylinder. These are drawn by simply cutting open the cylinder into a 
fiat sheet, drawing straight lines on the sheet and rolling it up again to reform the 
cylinder. Clearly the geodesies behave just as in the plane: there is no geodesic 
deviation (second derivative of separation with respect to distance along geodesies 
is zero), showing again that a cylinder is a fiat space. 

How are we to compute the geodesic deviation? We need to consider a family 
of geodesies x*(s,5), where the parameter 6 labels (continuously) the geodesies in 
the family and for fixed 6 the parameter s is the distance along a geodesic. Note 
that 6 is not the distance between geodesies since two fixed values of 6 determine 
two geodesies that may be moving apart. The vector joining two infinitesimally 
separated geodesies with equal parameter value s is, however, given by d6 with 

= ^4^. (3.99) 
od 
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Figure 7: Two geodesies (great circles) on the sphere. The separation as a function 
of distance from a meeting point is not linear; as one moves along, the geodesies have 
an "acceleration" towards each other. Tangent vectors to the geodesies are drawn at 
the points where the separation has reached a maximum. These vectors are parallel in 
three-dimensional space. 



Now, what we are interested in is the rate of change of this joining vector as s 
increases, specifically its second covariant increment along the geodesic, D^V\ To 



compute the geodesic deviation we use the notation (3.60) of D and utilize the 
relationship 



ds ^ 86 



d'^x^ 



dx^ dx^ 



dx^ ^ dx^ 
85 ^ ds 



(3.100) 



Then we calculate D^y* 
ing the Leibniz rule of derivatives (3.70), 



ds 86 ds 86 

U^V j{U'''V kV'')ds'^ , regarding the Vj as operators obey- 



U^V (V, Vfc - VfeV,) W + V'^Vk [U^VjU') 
U'V" (V, Vfc - VfcV,) w 



(3.101) 



where we applied the geodesic equation (3.98) in the last step. If the commutator 



of the covariant derivatives vanishes the geodesic deviation is zero and the space 
is fiat. What is the meaning of this commutator? Recall the concept of parallel 
transport. Imagine we move a vector from the point x^ to the infinitesimally 
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Figure 8: Parallel transport around a closed loop on the sphere. A vector at the North 
Pole is parallel transported to the equator along the geodesic to which it is tangent. 
Since a geodesic parallel transports its tangent vector, the vector is still tangent to the 
geodesic at the equator. The vector is then parallel transported a quarter of the distance 
around the equator (also a geodesic). Parallel transport preserves the vector, as much 
as the curvature allows, so it stays at the same angle to the equator. Finally the vector 
is parallel transported back to the North Pole along a geodesic. On return to the North 
Pole the vector is rotated by 90 degrees with respect to the initial vector. This rotation 
is completely specified by the Riemann curvature tensor. Note that on a plane (zero 
Riemann tensor) this kind of operation does not rotate the vector. 



close neighbor + dx* and then again to the next neighbor by another increment 
dy*; finally we close a loop in moving the vector back by — dx* followed by — dy*: the 
vector would change as (VjVfc — Vk^j)A^dx^dy''. Any closed loop we can imagine 
as consisting of patches of infinitesimal loops; so, in a non-Euclidean geometry, a 
vector transported along a closed loop does not return to itself, see Fig.|8| Foucault's 
pendulum [SU], transported with the rotating Earth on a sphere, does not return to 
its original oscillation direction after one loop (24 hours). 

To calculate the commutator (VjVfc — VfeVj)A* we use the covariant derivative 
(3.71) of the mixed tensors V^^* and VjA* first and apply then the formula (3.60) 
for the covariant derivative of a vector. We get, utilizing the symmetry (3.57) of the 
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Christoffel symbols, 



= + ^^ij {A\k + r'fc^A™') 

~ (■'^^fcj ~ ^'^ i3,k + — r^^^jF"*;) AK (3.102) 

The quantity in brackets on the right-hand side of this formula depends only on 
the geometry of the space (the metric) and determines the amount of curvature. It 
transforms as a tensor with four indices (3.37), because the left-hand side of Eq. 
(3.102) is a tensor with three indices and the right-hand side contains a contraction 
over a vector. The space is flat if and only if this quantity is zero, giving zero 
geodesic deviation. It is one of the most important objects in geometry, the Riemann 
curvature tenso^ 

TDl — I P™ P* P™- (Q 1 C\'i,\ 

^ jkl — ^ jl,k ^ jk,l ^ mk^ jl ml^ jk' {O.LVO) 

The Riemann tensor describes both the geodesic deviation 

DV^ = R.^y^dx^ dx*= (3.104) 

and the result of loops in parallel transport 

(V, Vfe - V,V,) A' = R\^,AK (3.105) 

In Euclidean space, no matter how complicated the coordinate system we use, the 
Riemann tensor will always turn out to be zero (the reader may check this for spher- 
ical polar coordinates). On the other hand, for a curved space like the sphere, the 
Riemann tensor will not vanish in any coordinate system. In arbitrary coordinates 
= {x^, x^} a sphere of radius a has a Riemann tensor 

KcD = ^(^^c9bd - 5%gBc), (3.106) 



as can be checked for the case of the coordinates (3.90). Incidentally, besides the 
sphere, all readers are familiar with the physical effect of another curved space: 
the space-time geometry in which they reside, the geometry of gravity [HIl 197] . 
Here space-time is curved, not only three-dimensional space. This space-time has 
a Riemann tensor whose largest components at the surface of the Earth are of the 
order of 10~^^m^^. This space-time curvature is the reason the reader does not float 
off into space. 



^The tensor character (3.371 of R'' j^i can be verified using the transformation properties of the 
Christoffel symbols. 
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4 Maxwell's equations 



After having discussed the mathematical machinery of differential geometry we are 
now well-prepared to formulate the foundations of electromagnetism, Maxwell's 
equations ^3j|. In empty space, the Maxwell equations for the electric field strength 
E and the magnetic induction B are 

V-E = —, V-B = 0, 

„ ^ OB ^ ^ 1 dE . ^ 

We use SI units with electric permittivity Eq, magnetic permeability /xq and speed 
of light c in vacuum, Sq^q = c~^. Charge and current densities are denoted by p 
and j. In the following we express Maxwell's equations in arbitrary coordinates 
and arbitrary geometries. We show how a geometry appears as a medium and 
how a medium appears as a geometry. We develop the concept of transformation 
optics where we use the freedom of coordinates to describe transformation media 
as elegantly as possible. Furthermore, we generalize transformation optics to space- 
time geometries. We also return to our starting point, Fermat's principle. 



4.1 Geometries and media 



Maxwell's equations (4.1 ) contain curls and divergences. Using the expressions (3.83 ) 



and (3.85) from differential geometry we can now write these in arbitrary coordi- 
nates: 



In 



1 dE^ 



dt 



(4.2) 



This form of Maxwell's equation is also valid in arbitrary geometries, i.e. in curved 
space, for the following reason: any geometry, no matter how curved, is locally 
flat — at each spatial point we can always construct an infinitesimal patch of a 
Cartesian coordinate system, although these local systems do not constitute a single 
global grid. For each locally flat piece we postulate Maxwell's equations (4.1), and 



in writing these equations in arbitrary coordinates we naturally express them in a 
global frame. 



Let us rewrite the form (4.2) of Maxwell's equations with all the vector indices in 



the lower position and the Levi-Civita tensor expressed in terms of the permutation 



symbol according to formula (3.46): 



[ijk]E, 



k,j 



dt 



[ijk]Bkj 



d(±./gg'^Bi) ^ ^ 1 d(±./gg'^EA , , 

^ vyy J) r„-.MD_ _ ^ vyy + (4.3) 



dt 
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In this form, Maxwell's equations in empty space, but in curved coordinates or 
curved geometries, resemble the macroscopic Maxwell equations in dielectric media 
153], 



V-D = p, V-B = 0, 

dB OD 

VxE = - — , VxH = — + fioj, (4.4) 



written in right-handed Cartesian coordinates: 



[ijk]Eu, = [ijk]HKj = f- (4-5) 



In fact, the empty-space equations (4.3) can be expressed exactly in the macroscopic 



form (4.5) if we replace Bi in the free-space equations by Hi/ij,Q, rescale the charge 



and current densities, and take the constitutive equations 

D' = eoe'^Ej, B' = fiol^'^Hj. (4.6) 
e'^ = = ±^ g'^ . (4.7) 

Consequently, the empty-space Maxwell equations in arbitrary coordinates and ge- 
ometries are equivalent to the macroscopic Maxwell equations in right-handed Carte- 
sian coordinates. Geometries appear as dielectric media. The electric permittivities 
e*-^ and magnetic permeabilities fi^^ are identical — these media are impedance- 
matched [53], and the e^^ and /i*-^ are matrices — the media are anisotropic. Spatial 
geometries appear as anisotropic impedance-matched media. 

The converse is also true: anisotropic impedance-matched media appear as ge- 



ometries. We easily derive this statement from the constitutive equations (4.7): 



calculate the determinant det e of e^^ . The result is det e = ±y/g, the factor in front 



of the metric in the constitutive equations (4.7). So we obtain from the constitutive 



equations of a geometry the metric tensor of a medium 

For general = /i*-' , this geometry is curved. But, if and only if the Riemann tensor 



(3.103) vanishes the spatial geometry is flat. In such a case there exists a coordinate 
transformation of physical space where Maxwell's equations are purely Cartesian, 
where space is fiat and empty. The electromagnetic fields in real, physical space are 
transformed fields — the results of coordinate transformations. Media that perform 
such a feat are called transformation media. 

4.2 Transformation media 

Transformation media implement coordinate transformations in Maxwell's equa- 



tions. Note carefully how this interpretation of Maxwell's equations (4.2) works: we 
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write the free-space equations in coordinates that are not right-handed Cartesian, 
but we then interpret these equations as being in a right-handed Cartesian system 
with an effective medium (4.7). This sounds a bit paradoxical, but the way to 



think of it is to imagine two different spaces as well as two different coordinate sys- 
tems, see Fig. |9j In the first space, which we call electromagnetic space, we have no 
medium and we write the empty-space Maxwell equations in right-handed Cartesian 
coordinates. We then perform a transformation that gives us a non-trivial effective 



medium (4.7) and we interpret the transformed coordinates as being right-handed 
Cartesian in a new space, physical space, which contains the medium (4.7). The 



Cartesian grid in electromagnetic space will deform under the transformation and 
this deformed grid shows ray trajectories in physical space. 




Figure 9: Transformation media implement coordinate transformations. The left figure 
shows the Cartesian grid of electromagnetic space that is mapped to the curved grid of 
physical space shown in the right figure. The physical coordinates enclose a hole that is 
made invisible in electromagnetic space (where it shrinks to the point indicated there). 
Consequently, a medium that performs this transformation acts as an invisibility device. 
The case illustrated in the figure corresponds to the transformation r = Ri+r' {R2—R1) / R2 
in cylindrical coordinates where the prime refers to the radius in electromagnetic space. 
The region with radius Ri is invisible; R2 — Ri describes the thickness of the cloaking 
layer. 



There are two aspects of transformation media that make them highly significant. 
First, we know a good deal about solutions of Maxwell's equation in vacuum (light 
rays travel in straight lines, etc.) and to find the effect of the medium we can just 
take a vacuum solution in electromagnetic space and transform to physical space 
using the coordinate transformation that defines the medium: the transformed fields 
are a solution of the macroscopic Maxwell equations in physical space. Second, since 
a transformation medium is defined by a coordinate transformation, we can use this 
as a design tool to find materials with remarkable electromagnetic properties. 

Some readers may have nagging doubts about the juxtaposition of the mathe- 
matical tools of general relativity with the attribution of a physical significance to 
coordinate transformations. For a relativist coordinate systems have no physical 
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meaning; the geometry of the space is the important thing, and that is independent 
of the coordinate grid one chooses to cover the space. But here we wish to consider 
materials that, as far as electromagnetism is concerned, perform active coordinate 
transformations. In this theory, the coordinate transformation is physically sig- 
nificant, it describes completely the macroscopic electromagnetic properties of the 
material, and differential geometry is just as useful for these purposes as it is in 
general relativity. 

In our description of transformation media, the starting point of the theory was 
a right-handed Cartesian system in electromagnetic space; any non-trivial transfor- 
mation from this system gives an effective medium in physical space. It is, however, 
often convenient to adjust coordinates to the particular situation under investigation, 
see Fig. 10 We should be allowed to use any coordinates we wish in electromagnetic 
space. In order to implement this freedom of coordinates, we generalize the theory. 
Suppose that we describe electromagnetic space by a curvilinear system such as 
cylindrical or spherical polar coordinates; any deformation of this system through 
a coordinate transformation is to be interpreted as a medium, but to describe the 
electromagnetism in the presence of this medium we employ the original curvilinear 
grid. 



Figure 10: Transformation media in cylindrical coordinates. The figure shows the 
transformation of Fig.[9]in cyhndrical coordinates. These are the coordinates best adapted 
to this case. 



Let be the curvilinear system in electromagnetic space and we denote its met- 
ric tensor by 'jij. Then in electromagnetic space the empty-space Maxwell equations 
Q are: 

where we have made the metric dependence of the Levi-Civita tensor explicit. Now 
we perform a coordinate transformation and, as before, we interpret the resulting 
equations as being macroscopic Maxwell equations written in the same (curvilinear) 
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system we started with, but in physical space. We cast the equations (4.3 ) in physical 
space as the macroscopic equations (4.4) in the curvilinear system with the metric 



[ijk]Ekj 



0, 



at 



[tjk\Hkj = + /^oV7 J 



(4.10) 



By the same reasoning as before, we can interpret the free-space equations (4.3) 
as macroscopic equations (4.10) written in the curvilinear system if we rescale the 
charge and current densities and take the constitutive equations (4.6) with 



77 



(4.11) 



If we wish to implement a certain coordinate transformation, formula (4.11) gives a 
simple and efficient recipe for calculating the required material properties in arbi- 
trary coordinates [73]. 



4.3 Wave equation and Fermat's principle 



Impedance-matched media establish geometries and geometries appear as impedance- 
matched media. In general these media are anisotropic, but we still expect that light 
rays follow a version of Fermat's principle (2.1) of the extremal optical path. The 



metric gij should set the measure of optical path length. So we anticipate that light 
rays follow a geodesic with respect to the geometry given by the material proper- 
ties (4.8). How do we deduce Fermat's principle from Maxwell's equations? First 
we write down the wave equation for monochromatic fields with frequency u, the 



refined version of the Helmholtz equation (2.6). Any electromagnetic field consists 



of a superposition of monochromatic fields, if g^j does not change in time, which we 
assume, and also that there are no external charges and currents in the region we 



consider. We obtain from Maxwell's equations (4.2): 



UJ 



E\ 



(4.12) 



Note that we expressed the derivatives as covariant derivatives (semi-colons instead 
of commas) in anticipation of simplifications to come. The covariant derivative of the 
Levi-Civita tensor euim is zero, because it is determined by the metric tensor which 



has vanishing covariant derivative [97]. Consequently, using the notation (3.60) for 
the covariant derivatives 



= e'^hunNkVE^ = {5\5l - 6^6^) V^VE^ = V.V'E^ - V^VjE\ (4.13) 



where we applied formula (3.51 ) for the double vector product. The first term in the 



right-hand side of Eq. (4.13) resembles the covariant derivative of the divergence. 
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VjE^, that would vanish since the charge density is zero. But covariant deriva- 
tives do not commute in generah their commutator is given by the Riemann tensor 



(3.105). So we obtain, lowering the index i 

2 



^ = (VfcV, - V,V,) E'' - W,E„ (4.14) 



and arrive at the wave equation \110\ llllj 



^2 



WjE, + Ri^g^'Ek + —E, = (4.15) 



where Rij denotes the Ricci tensor [6T1 [9 



R., = R%. (4.16) 

The non- Euclidean geometry established by the medium may scatter light, even in 
the case of impedance matching. Impedance matching [53] may significantly reduce 



scattering, but not completely, unless the Ricci tensor (4.16) vanishes. In this case 
the Riemann tensor vanishes as well in three-dimensional space (see exercise 1 in 
§92 of Ref. [61]). The Riemann tensor quantifies the measure of curvature, irre- 
spective of the coordinates. If the Riemann tensor vanishes the geometry is fiat — 
the apparent curvature is an illusion where curved coordinates disguise a straight 
system: the material is a transformation medium. So only transformation media 
cause absolutely no local scattering]^ Only they guide electromagnetic waves without 
disturbing them; they merely transform fields from electromagnetic to real space. 
However, some transformation media still cause scattering in situations where the 
topologies of the two spaces differ from each other. In §5 we discuss two exam- 
ples of topological scattering (though for space-time transformations), the optical 
Aharonov-Bohm effect and analogues of the event horizons. 

Let us return to Fermat's principle. In the regime of geometrical optics [H] the 
phase of electromagnetic waves advances more rapidly than the variations of the 
dielectric properties of the material. The medium determines the wavelength A, and 
so A varies with the variations of the material properties. For geometrical optics to 
be a valid approximation, the gradient of the wavelength should be small, |VA| -C 1 
[62] . We represent the electric-field components as 

E^ = £ie''^ (4.17) 

where Si is a slowly varying envelope and ip the rapidly advancing phase. The 
gradient of the phase V<y9 describes the wave vector k and the wavelength is given 



by A = 27r/|fc|. We substitute the ansatz (4.17) into the wave equation (4.15) and 
take only the dominant terms into account, terms that contain products of the first 
derivatives of the phase or u'^, ignoring all other terms, including the curvature 



^In one dimension, all impendance-matched non-moving dielectrics are transformation media, so 
impedance-matched waveguides are reflectionless [55] , but some non-impedance-matched materials 



(with soliton index-profiles) are reflectionless too [ITJ (55] . 



37 



contribution. The amplitude Si is common to all remaining terms, and so the wave 
equation reduces to the dispersion relation 



(4.18) 



The dispersion relation contains the speed of light in the medium, d . For example, 
for an isotropic medium with refractive index n = e = fi we have the inverse metric 
tensor g''^ = n~^l and c' = c/n. For anisotropic media, the eigenvalues of the matrix 
g'^^ characterize c' for light rays propagating in the directions of its eigenvectors; c'/c 
is the square root of the corresponding eigenvalue of g^^ . 

In order to obtain Fermat's principle, we take advantage of the connection be- 
tween optics and classical mechanics (the connection that inspired Schrodinger's 
quantum mechanics in analogy to wave optics). We read the dispersion relation 



(4.18) as the Hamilton- Jacobi equation of a fictitious point particle that draws the 

For this we need to identify the wave vector 



spatial trajectory of the light ray [60] 
ip^i as the canonical momentum, as the derivative of some Lagrangian L' with re- 
spect to the velocity x\ We might be tempted to employ the Lagrangian (3.94) of 
the geodesic equation, but it is wise to use 



L2 _ 1 
2 ~ 2 



In this case we obtain 



and the Hamiltonian 



dL' 



(4.19) 
(4.20) 

(4.21) 



dL' 1 1 

^' " ~ ^' " 2 " 2 

Since the metric does not depend on time the Hamiltonian is conserved [60]. The 
conserved quantity is the "energy" of the fictitious particle. Our choice (4.19) of the 



Lagrangian is consistent with the dispersion relation (4.18) if we put 



H' 



2^' 



(4.22) 



This proves that L' is a suitable Lagrangian for light rays. The resulting Euler- 



Lagrange equations give the geodesic equation (3.97). So light rays follow a geodesic 



of the metric gij, they take an extremal optical path given by the properties of the 
medium (4.8): light follows Fermat's principle. 



4.4 Space-time geometry 

So far we considered spatial geometries or coordinate transformations in space, but 
there are also important examples of transformation media that mix space and 
time [71]. In §5 we discuss two cases in detail, the optical Aharonov-Bohm effect 
of a vortex and optical analogues of the event horizon. Here we write down the 
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foundations for the theory of space-time transformation media, Maxwell's equations 
in a space-time geometry. 

First, let us introduce space-time coordinates. The role of the Cartesian coor- 
dinates {x, y, z} is played by the Galilean system {ct, x, y, z} where t denotes time. 
In §3 we developed differential geometry in three-dimensional space, but it is easy 
to extend the treatment to four dimensions — the indices just take one more value 
and the Levi-Civita tensor has one more index. In this manner one can treat arbi- 
trary coordinates in space-time. The appropriate distance in Galilean coordinates 
{ct, X, y, z} in fiat space-time is, however, not given by the four-dimensional Eu- 
clidean metric, but rather by the Minkowski metric [61j: 

ds^ = c^de - dx' - dy2 - d^2. (4.23) 

Here we use the Landau-Lifshitz convention for the metric where spatial distances 
are counted as negative contributions (one can also use the opposite metric — ds^ 



where space counts as positive). The metric (4.23) rather describes a measure of 
time. It is customary to denote the time coordinate by x^ and the three spatial 
coordinates by where the Latin indices run over {1, 2, 3}. The metric in a general 
space-time coordinate system (or in curved space-time) is 

ds^ =^„/3da;"da;'^, (4.24) 

where the Greek indices run over the four values {0, 1,2,3}. The component Qqq is 



usually positive throughout space-time, as in the Minkowski metric (4.23), and the 
determinant g is always negative. 

It turns out that the theory of transformation media and materials that mimic 
curvature also works in four-dimensional space-time. This is proved in the Appendix, 
which provides a more challenging example of tensor algebra. There we show that 
the free-space Maxwell equations in arbitrary right-handed space-time coordinates 
can be written as the macroscopic Maxwell equations in right-handed Cartesian 
coordinates {ct,x,y,z\ with Plebanski's constitutive equations [112i| 

D' = eoe'^Ej + ^[ijk]wjH'', Bi = fiofi'^ - ^[ijk]wjEk, (4.25) 

= ij^i = _ gij^ u;^ = — . (4.26) 
900 900 

Space-time geometries appear as media. 



The constitutive equations (4.26) turn out to reveal an important hidden prop- 



erty of electromagnetism: electromagnetic fields are conformally invariant in space- 



time. Suppose we compare two geometries, one with the metric (4.24) and one with 
a metric where we re-scale equally space and time at each point, but the scaling 
factor Q may vary over space-time, 

g^p ^ Qix") g^^p. (4.27) 

This is not a coordinate transformation in general: we compare two different ge- 
ometries. They measure space-time distances differently, but the angles between 
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world lines are the same. As a result of the conformal scaling (4.27), the inverse 
metric tensor (7"^ scales with and the determinant g with fi^. So e^^ , fi^^ and 
Wi do not change; but these are the only quantities that depend on the geome- 
try. Consequently, the electromagnetic field does not notice a conformal space-time 
transformation, electromagnetism is invariant if space and time are re-scaled equally. 
However, if we only altered the measure of space by a spatially dependent factor 
n, the new geometry would behave like a dielectric medium with refractive index 
profile n. Conformal invariance is the basis of Penrose diagrams |149j where the en- 
tire causal structure of infinitely extended space-time is condensed, by a conformal 
factor, into a finite map one can draw and discuss. In Sec. 5.4 we apply the con- 
formal invariance of electromagnetism to discuss the space-time geometry generated 
by moving media 



For transformation media, we can generalize the constitutive equations (4.25) 



(4.26) to allow for a handedness change in the spatial part of the space-time coordi- 



nate transformation, and also to allow for a curvilinear spatial coordinate system in 



electromagnetic space. Our previous result (4.11) shows how to incorporate these 



possibilities in the permittivity and permeability in the constitutive equations (4.26). 



In addition, we can express the constitutive equations (4.25 )-(4.26 ) in index-free form 
if we denote the permittivity and permeability matrices by e and /x, respectively, 
and understand eE, etc., as a matrix product. Our final constitutive relations are 
then [H] 

w w 

D = EoeE + — X H, B = fiofiH x E, (4.28) 

c c 

e^i = = g'', Wi = ^. (4.29) 

vO' fi'oo fioo 

In addition to the familiar impedance-matched electric permittivity e and mag- 
netic permeability /i, a transformation that mixes space and time mixes electric and 
magnetic fields. A space-time geometry appears as a magneto-electric medium, also 
called a bi-anisotropic medium |126t ll28J. The mixing of electric and magnetic fields 
is brought about by the bi-anisotropy vector w that has the physical dimension of a 
velocity. In Sec. 5.4 we show that w is closely related to the velocity of the medium 
(for slow media, w is proportional to the velocity). Moving media are naturally 
magneto-electric [63] — a moving dielectric responds to the electromagnetic field in 
its local frame, but this frame is moving and motion mixes electric and magnetic 
fields by Lorentz transformations EI]- Such phenomena have been observed 
before special relativity was discovered, for example in the Rontgen effect [HH 1113] 
in 1888 or, indirectly, in Fizeau's 1851 demonstration [36] of the Fresnel drag [37] . 
More recently, moving optical media have been shown to generate analogues of the 
event horizon [106] . 

5 Transformation media 

The concept of transformation media has been the key idea for the design of invisibil- 
ity devices [121 SH [Z21 1104] , an idea that was put into practice using electromagnetic 
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metamaterials |122] . Moreover, the idea that inspired the surge of interest in meta- 
materials in the first place, the perfect lens |1U1] , turned out to represent an example 
of transformation optics as well [71]. Furthermore, the optical Aharonov-Bohm ef- 
fect [261 SSI ESI |66] and optical analogues of the event horizon [681 1106] can be 
understood as cases of media that perform space-time transformations [71]. In this 
section we discuss these examples in some detail. 



5.1 Spatial transformation media 



Let us first focus on some general properties of spatial transformation media. These 
are media that perform purely spatial coordinate transformations of electromagnetic 
fields. We develop an economic form of the theory that allows quick calculations and 
rapid judgment on the physical properties of spatial transformation media. For this, 
we distinguish two different coordinate systems and three matrices of metric tensors: 
the coordinates of electromagnetic space with metric tensor gi/ji, which in physical 
coordinates appears as the tensor components Qij. The physical coordinates x* 
are not necessarily Cartesian, but characterized by the metric 7jj of physical space, 
and neither are the electromagnetic coordinates required to be Cartesian, because it 
is often wise in physics to adopt the coordinates to the particular physical problem 
under investigation. 

Transformation media are made of anisotropic materials, in general, that are 
characterized by e and /i tensors. In curvilinear coordinates, the components of the 



dielectric tensors are given by Eqs. (3.36) and (4.11) as 
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(5.1) 



where ± indicates the handedness of the physical coordinates with respect to the 
electromagnetic coordinates, -|- for right-handed and — for left-handed coordinate 
systems; g and 7 denote the determinants of gij and •jij. We obtain from the 
transformation (3.17) of the metric tensor 



^7 = det {g',Mee{A' 



det jg'ij) ^ 
det\A\>) det'(A^ 



g' 



(5.2) 



We employ a convenient matrix form for the dielectric tensors ([5.1 ) by defining the 
matrices 



G' = (g,,,) , F = (7,,) , A = (A^,) 



(5.3) 



The first index labels the rows and the second index the columns of the matrices. 



In terms of the matrix representation (5.3) we obtain for the dielectric tensors (5.1) 

VdetG' AG'-^A^ 



e = u= , — - — - — . (5.4) 
detA ^ ' 

The equality of the electric permitivity e and the magnetic permeability [x means 
that transformation media are impedance-matched [53] to the vacuum, but in prac- 
tice impedance-matching can be relaxed at the expense of introducing some slight 
additional scattering [Ml ITgifT^. 
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For an anisotropic medium, the eigenvalues of the dielectric tensors Ec and /ic 
in Cartesian coordinates describe the dielectric responses along the three axes of 
the medium at each point in space; the axes are given by the eigenvectors. For 
transformation media, the tensor Ec = is symm etric , because it is constructed from 
the symmetric inverse metric tensor ^ in Eq. (5.1). Consequently, the matrix Ec 



has three real eigenvalues with the orthogonal eigenvectors a, the three orthogonal 
axes of dielectric response. Here we show how to calculate the eigenvalues from the 



dielectric tensor (5.4) in curvilinear coordinates. Consider the mixed tensor eT 



(^*'7'i) • Suppose we transform this tensor from curvilinear to Cartesian coordinates 
by the transformation matrix Ac. According to the rules of tensor transformations 
conveniently indicated by the index positions in e'^j, the curvilinear eT turns into 
the Cartesian Ec = Ac^FA"^, and vice versa eT = A'^EcAc- Consequently, A^^a 
is an eigenvector of eF with eigenvalue the eigenvalues of eF are those of the 
dielectric tensor in Cartesian coordinates. Note that the eigenvectors of e F are not 
orthogonal in general (and e^j is not symmetric). Nevertheless, the eigenvalues of 
E F directly give the dielectric functions of the transformation medium. 

To illustrate why this procedure offers considerable economy in calculations, 
we discuss the example of a cylindrical transformation. Suppose that the medium 
transforms the radius r in cylindrical coordinates, but preserves the angle and the 
vertical coordinate z, 



r x' 



(r') , r' = y/x'^ + 



X 



ry 



z = z . 



(5.5) 



We calculate the transformation matrix A from {x,y,z} to {x',y',z'} and express 
the result as 



/T 
R cos^ (h -\ sin^ i 



A 



r 



cos sm ( 



R ] cos sm ( 





cos + -R sin 






1 / 



(5.6) 



with 



R 



dr 
dr' 



det A 



R. 



(5.7) 



We obtain the dielectric tensor in Cartesian coordinates 



r 



cos^ + 



,/2 



sm 



R — \ cos sm I 







i?2 



./2 



COS sm ( 



„/2 



COS (j) + R sin 



\ 

calculate the eigenvalues, and obtain after some algebra 







(5.8) 



R 



r ^ Rr' ^ rR 



(5.9) 



42 



Alternatively, we calculate e in the coordinate system that is best adapted to cylin- 
drical transformations: both in electromagnetic and in physical space we use cylin- 
drical coordinates. From the line element in cylindrical coordinates ds^ = dr^ + 
r^d^^-l-d^^ we read off the metric tensors F = diag (1, r^, 1) and G' = diag (1, r'^, 1). 
For transformations of the radius A = diag {R, 1, 1). Consequently, 



(5.10) 



eT = diag R 



r Rr' rR 



this elementary calculation reveals the dielectric properties of the cylindrical trans- 
formation medium. If the transformation opens a hole in physical space, anything 
inside this hole is decoupled from the electromagnetic field: it has become invisible. 



as Fig. 11 showsF 



(a) 



ib) 





-2 -1 x' 1 2 3 -2 -1 X 1 2 3 



Figure 11: Invisibility for waves. The device transforms waves emitted in physical 
space as if they propagate through empty space in transformed coordinates, (a) Trans- 
formed space. The figure shows the propagation of a spherical wave emitted at the point 
(—1.5,0,0). (b) Physical space. The invisibility device turns the wave of figure (a) into 



physical space by the coordinate transformation illustrated in Fig. 10, The cloaking layer 



with radii Ri = 0.5 and R2 = 1.0 deforms electromagnetic waves such that they propa- 
gate around the invisible region and leave without carrying any trace of the interior of the 
device. 



5.2 Perfect invisibility devices 

Transformation media that have holes in their coordinate grids in physical space act 
as perfect invisibility devices. Consider the following simple example in spherical 
coordinates |104j . Suppose the device transforms the radius r' in electromagnetic 
space, but does not affect the spherical angles, 

r = r(r), 9 = 9', = 0'. (5.11) 



^The corresponding figure in Ref. [77] is not correct. We thank T. Tyc for pointing this out. 
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We obtain from Eq. (3.23) the matrices F = diag(l, r^, sin^ 6') and G' = diag(l 



^'2 ^12 



sin^6'), and, as in the previous subsection, the transformation matrix of the 



devices is A = diag {R, 1, 1) with R = dr/dr'. Consequently, 



„/2 



eV = 



diag 



^ ^:;r2 



r.l2 



J2 



= diag(^,i,,i, 



(5.12) 



Figure 12 shows an example of a radial transformation when r(r') reaches a finite 
value R\ at the origin in electromagnetic space where r' = 0. This means that 
in physical space the entire spherical shell of radius Ri corresponds to a single 
point in electromagnetic space. Anything lurking inside that sphere has become 
excluded from the electromagnetic field: it has become invisible. Figure 12 also 
shows that beyond the radius R2 electromagnetic and physical coordinates agree — 
R2 describes the outer radius of the cloaking device, whereas Ri is the inner radius. 
If electromagnetic and physical coordinates agree beyond the outer layer of the cloak 
then outside of the device electromagnetic waves are indistinguishable from waves 
propagating through empty space. The device guides electromagnetic waves around 
the enclosed hidden object without causing disturbances. So the object has not only 
disappeared from sight, but the act of hiding has become undetectable; the scenery 
behind the cloaking device would show no sign of the object nor of the device. In 
short, this transformation makes a perfect cloaking device. This behavior has been 
verified using scattering theory |117j and the most appropriate boundary conditions 
for the inner layer of the cloak have been identified [i5] . 




Figure 12: Cloaking transformation. Radial transformation performed by a spherical 
cloaking device. The radius r' of spherical coordinates in electromagnetic space is trans- 
formed to the radius r in physical space, and vice versa. Ri denotes the inner and R2 the 
outer radius of the cloaking shell: physical regions with radius r < Ri are not reached 
by electromagnetic waves and for r > R2 the electromagnetic coordinates agree with the 
coordinates of physical space. 



Fortunately perhaps, perfect invisibility [104j is severely limited [71]. Figure 13 
shows the trajectories of light rays passing through the cloaking shell. The rays make 
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Figure 13: Spherical cloalting device. (From Ref. |104j . Reprinted with permission from 
AAAS.) The figure shows the trajectories of light rays. (A) two-dimensional cross section 
of rays striking the device, diverted within the annulus of cloaking material contained 
within i?i < r < i?2 to emerge on the far side undeviated from their original course. (B) 
three-dimensional view of the same process. 



a detour around the hidden core of the device, but they must arrive at the same 
time as if they were propagating through empty space. So there is only one option: 
inside the cloaking device the phase velocity of light must exceed c. This is possible 
in principle [l3l |85] , but only in regions of the spectrum with narrow bandwidth that 
correspond to resonances in the material. But, to make matters worse, light rays 
straddling the inner lining of the cloak must go around the inner core in precisely 
the same time it would take them to traverse a single point in electromagnetic space, 
in zero time. Light must propagate at infinite phase velocity! One can put these 
thoughts into precise mathematical terms by considering the product 616263 of 
the three dielectric functions of the transformation medium. This product is the 
product of the eigenvalues of eF, the determinant of eF. We obtain from formula 



element in electromagnetic space. The volume of a single point is zero; hence at 
least one of the eigenvalues of the dielectric tensor is zero - the speed of light 
reaches infinity at the inner lining of the cloak. This inevitable feature restricts the 
performance of perfect invisibility devices to a single frequency set by the particular 
metamaterial used. Perfect invisibility devices are impractical, but imperfect devices 
inspired by these ideas may be possible. 

5.3 Perfect lenses 

The perfect lens, made by negative refraction |10H 1143] , turns out to be an exam- 
ple of a transformation medium as well [13]. As in perfect invisibility devices, the 



(5.4) 




(5.13) 



We showed in Sec. 3.2 that V det G' describes the volume element, here the volume 
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unusual properties of the lens are based on an unusual topology of the transfor- 
mation. For achieving invisibility, electromagnetic space does not cover the entire 
physical space, whereas for perfect lenses electromagnetic space turns out to be 
multi-valued: single points in electromagnetic space are mapped to multiple points 
in physical space. 

Consider in Cartesian coordinates the transformation x{x') illustrated in Fig. 



14, whereas all other coordinates are not changed. In the fold of the function x{x') 
each point x' in electromagnetic space has three faithful images in physical space. 
Obviously, electromagnetic fields at one of those points are perfectly imaged onto 



the others: the device is a perfect lens, see Fig. [T5j Perfect lensing was first analyzed 
|1U1] as the imaging of evanescent waves in a slab of negatively-refracting material, 
waves that may carry images finer than the optical resolution limit [TT]. Various 
aspects of this idea |101] have been subject to a considerable theoretical debate 
[89| 1102^ 11411 1103[ 196] , but recent experiments [33l HU [82] confirmed sub-resolution 
imaging. Our pictorial argument leads to a simple intuitive explanation of why such 
lenses are indeed perfect [71] . 

We also immediately obtain from the theory of transformation media the reason 
why perfect lensing requires left-handed media with negative e and fi jHOl EHl 11201 
11271 1144] : inside the device, i.e. inside of the x' fold, the derivative of x{x') becomes 
negative and the coordinate system changes handedness. The electromagnetic left- 
handedness of negative-index materials appears in a left-handed coordinate system. 



We obtain from our recipe (5.4) the compact result 



^. / dx dx' dx'\ . ^ ^. 

^ = ^ = ^^"Hd^'d^'d^J ■ ^ ^ ^ 

At the place where the transition between right-handed and left-handed coordinates 
occurs, the derivative of x{x') is undefined and so are e and /i, but this has no adverse 
effect in practice It simply describes the discontinuity at the boundary of the 
medium where our transformation x'{x) automatically gives the correct boundary 
conditions [53j, because Maxwell's equations are satisfied arbitrarily close to the 



boundary. Equation (5.14) shows that when dx'/dx is —1 inside the device and 
-|-1 outside, we obtain the standard perfect lens [lOlj 1143] based on an isotropic 
left-handed material with e = n = —1 inside. Furthermore, we see that lenses with 
e = fi = —1 are not the most general choice. One could use an anisotropic medium 
to magnify perfect images by embedding the source or the image in transformation 
media with |da;'/da;| ^ 1. 

Note that the perfect lens made by negative refraction jlOlj is not the only case 
of perfect imaging [11]. In 1854 Maxwell proposed a curious device called the fish- 
eye [HI]. In Maxwell's fish-eye light goes around in circles in such a way that any 
point in the device is imaged to a partner point with infinite precision. In turns 
out [75] that Maxwell's fish-eye also is an example of a transformation medium, 
but one with non-Euclidean geometry: here physical space is not mapped to flat 
space, but to our favorite example of a curved space, the surface of a sphere. In the 
following we use this visualization of the fish-eye to explain how it works. Consider 
the stereographic projection from the surface of a sphere with radius vq to the x, y 
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Figure 14: Perfect lens. Negatively refracting perfect lenses employ transformation 
media. The top figure shows a suitable coordinate transformation from the physical x axis 
to the electromagnetic x' , the lower figure illustrates the corresponding device. The inverse 
transformation from x' to x is either triple or single-valued. The triple-valued segment 
on the physical x axis corresponds to the focal region of the lens: any source point has 
two images, one inside the lens and one on the other side. Since the device facilitates 
an exact coordinate transformation, the images are perfect with a resolution below the 
normal diffraction limit |11| : the lens is perfect |101j . In the device, the transformation 
changes right-handed into left-handed coordinates. Consequently, the medium employed 
here is left-handed, with negative refraction |144j . 



plane as shown in Fig. [16} In formulae 

y 



X 



x' y' 



1 - z'/tq 
and the inverse transformation 
2x , 2y 



1 - z'/tq 



X 



1 + {r/vQY ' 



y 



1 + {r/rQ)'^ ' 



with x'^ + y'"^ + z' 



ro 



r/roY + 1 



(5.15) 



a;2 + y2. (5.16) 



Imagine light propagating on the surface of the fictitious sphere with uniform re- 
fractive index uq. In this case, the line element of electromagnetic space is 



ds^ = nl (dx'2 + dy'2 + dz'2) 



(5.17) 
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Figure 15: Propagation of electromagnetic waves in a perfect lens. The lens facilitates the 
coordinate transformation shown in Fig. [T4| Spherical waves in electromagnetic space are 



transformed into physical space, (a) The wave of Fig. 11 (a) is emitted outside the imaging 
range of the lens. The wave is transformed by the lens, but the device is not sufficiently 
thick to form an image, (b) A wave is emitted inside the imaging range, creating two 
images of the emission point, one inside the device and one outside, corresponding to the 



image points of Fig. 14 



Similar to Eq. (2.3) we express the differentials dx', dy' and dz' in terms of the 



differentials in physical space, and obtain 



ds^ = {dx^ + dy') , 



n 



+ y^ + Tq 



(5.18) 



By the way, this result also proves that the stereographic projection is a conformal 
map of the surface of the sphere on the two-dimensional planej^ We can easily extend 
this procedure from two-dimensional surfaces to three-dimensional spaces by con- 
sidering the stereographic projection from the hyper-surface of the four-dimensional 
sphere to three-dimensional space or simply by rotating the x,y plane around one 
axis, say the y axis, while performing two-dimensional projections. Because of ro- 
tational symmetry, we obtain in 3D 



ds' = {dx' + dy' + dz') 



n 



2nor' 



x^ + y^ + z^ + 



(5.19) 



Let us return to 2D where we can draw pictures, as shown in Fig. [I6| On the surface 
of the sphere, light rays propagate along geodesies, the great circles. It is one of the 
remarkable properties of the stereographic projection [93] that circles on the sphere 
are transformed into circles on the plane. So, in physical space, light goes around 



^The stereographic projection, invented by Ptolemy, is an ingredient of the Mercator projec- 
tion used in cartography; on the complex plane the Mercator projection is the logarithm of the 
stereographic projection [93 . 
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in circles as well. The great circles originating from one source point on the sphere 
meet again at the antipodal point. In the stereographic projection, the image of the 
antipodal point is the reflection of the source on a circle, the circle with the radius 
tq of the sphere. Here the source is perfectly imaged: Maxwell's fish-eye makes 
a perfect lens [11]. However, this is quite an usual instrument: both the source 
and the image are embedded in the non-uniform refractive index profile (5.19) of 
the fish-eye. In contrast, for the fiat lens made by negative refraction |lUlj source 
and image are outside the device; the lens acts across some distance. The imaging 
range is the distance from the lens where multiple images are formed, where x{x') 
is multivalued. As figure 14 shows, the imaging range is equal to the thickness of 
the lens for the standard case where dx'/dx = —1 and, accordingly, e = /i = — 1. 
Because of losses in the material, the imaging range has been very small in practice 
though [331 HUES]. 




Figure 16: Maxwell's fish-eye. The left figure illustrates the stereographic projection 
described in Eq. (5.15). A line is drawn from the North Pole through a point on the 
sphere. The image of this point is the intersection of this line with the plane through the 
Equator. The top right figure shows light rays on the surface of the sphere and projected 
on to the Equatorial plane; the bottom right figure shows the rays in the plane where the 
dotted circle indicates the Equator. Light rays originating from one point meet again at 
the antipodal point: Maxwell's fish-eye makes a perfect lens. 



In another twist of the story, both the interior of a hollow sphere and a perfect 
flat lens are predicted to cause unusual quantum effects, they lead to repulsive 
Casimir forces [El [76] . The Casimir force is a force caused by, literally, "nothing" 
— by the zero-point energy of the quantum vacuum [SHI [Hl|- What is zero- 
point energy? Imagine the electromagnetic field as a superposition of infinitely 
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many stationary modes. Some of the mode coefficients may be zero, but according 
to quantum physics being zero is a positive statement: the modes are in vacuum 
states. Each mode oscillates like a harmonic oscillator with its eigenfrequency uj^. 
The vacuum state is the ground state of the harmonic oscillator, but this state 
carries the energy fvjjy/2 [62]. The sum of the ground-state energies of all the modes 
is called zero-point energy. So even in the absence of any electromagnetic fields, 
in the complete quantum vacuum state, the zero-point energy remains. Moreover, 
this energy is infinite for infinitely many modes. On the other hand, the mode 
frequencies depend on the boundary conditions. Consider, as Casimir did [22], 
quantum electromagnetism inside two parallel mirrors with distance a. Although the 
total zero-point energy is infinite, the difference U between the zero-point energies 
per area at a and at oo turns out to be finite and negative [221 EHl El] , 

"(a)^-^. (5.20) 

a mesmerizing combination of natural constants. Read U{a) as the potential for the 
force between the mirrors with distance a. Since the potential falls with decreasing 
distance the mirrors are attracted to each other. This attraction caused by the 
quantum vacuum has been observed [5H] and good quantitative agreement with 
theoretical predictions was found [23]. Normally, the Casimir force is weak, but it 
may become important for small distances a, distances on the scale of nanomachines 

Imagine that we place a perfect lens between the mirrors. The lens should be 
sufficiently thick such that the mirrors lie within the imaging range h of the lens. 
The lens maps physical space to electromagnetic space where the mirror distance is 
reduced to 

a' = 26 -a, a<2h. (5.21) 

Electromagnetic space is empty, apart from the mirrors; there they are attracted 
to each other with potential W(a'), but in physical space this attraction turns into 
repulsion, because when a' decreases, a must grow. We obtain for the force per area 

<m _ dUja') da' _ Hctt^ 
•'~~d^~ da' d^ ~ 240a'4 ' ^' ^ 

This is not the only example of repulsive Casimir forces, but so far it is the only one 
where repulsion is explained by a simple visual argument. The vacuum confined to 
the interior of a hollow conducting shell causes repulsion as well [H] ; the "nothing" 
would cause the shell to explode if the shell is sufficiently small such that the Casimir 
force is strong. Another example is a miniature sandwich of different dielectric 
media, silicon, ethanol and gold [21] . Also in this case the Casimir force is repulsive 
for reasons that are far less intuitive [30] ■ It turns out [76] that the perfect lens needs 
to have gain for sustaining a repulsive Casimir force, but, in principle, it represents 
a simple case where two mirrors repel each other across some distance, without any 
adjacent mediator (such as the ethanol in a layer of silicon, ethanol and gold). The 
Casimir repulsion could be sufficiently strong such that one of the mirrors would 
overcome gravity, levitating on zero-point energy. 
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5.4 Moving media 

Perfect invisibility devices and perfect lenses are made of spatial transformation 
media - metamaterials that facilitate transformations in space, but that do not alter 
the measure of time. Here we develop a physical interpretation of electromagnetism 
under space-time transformations. We show that they correspond to moving media 
IB7] (or media with moving dielectric properties |1U6] ). For this we translate 



Fermat's principle (2.1 ) into a space-time geometry for moving media with isotropic 
and equal e = = n. We describe the electrodynamics of moving media in an inertial 
frame in Cartesian coordinates, the laboratory frame, but as an intermediate step we 
take the liberty of performing transformations to other inertial frames of reference. 

Imagine the medium consists of infinitesimally small cells moving with velocities 
u. The velocity profile may vary in space and time, but for a given space-time 
point we can always construct a locally co-moving inertial frame. In such a frame, 
the medium is not moving, at least at the specific point we are considering, and 
so we can apply here the standard theory of electromagnetism in media at rest. In 



particular, Fermat's principle (2.1) suggests that the electromagnetic field perceives 
here a spatial geometry with the line element dP = n^(dx^ + d?/^ + dz^). We translate 
the spatial geometry into a space-time geometry with the space-time line element 
ds^ = c^dt^ — dl"^, and hence the metric tensor Qajj = diag (1, — ra^, — n^, — n^) with 
the determinant —n^ and the inverse matrix g"'^ = diag (1, — ra"^, — — The 



reader easily verifies from Eq. (4.26) that this geometry indeed describes a medium 
at rest with e = = n. 

The local geometry of the medium is still purely spatial; it influences the spatial 
part of the metric, but not the measure of time. We can change this using the 
conformal invariance of Maxwell's equations (in the absence of external charges and 
currents). We may multiply the space-time metric by an arbitrary factor, called 



conformal factor, and still obtain exactly the same constitutive equations (4.25) as 
before. Multiplying diag (1, —n^, —n"^, —v?) by we obtain the new metric tensor 
Qajs = diag (ra"^, — 1, — 1, — 1) with determinant — and inverse matrix g"'^ = 
diag (n^, —1, —1, —1). This geometry influences the measure of time (in the metric 
element g^o) but not the spatial part of the metric. We write the metric tensor as 

9a(3 = Vafs + in-^-l)u^Ufs, ^7°^ = //"^ + (n^ - 1) u° (5.23) 
r]^p = diag(l,-l,-l,-l)=r/"^ (5.24) 

and = (1,0,0,0) = Ua- Then we use a key insight from special relativity [6T] : 



r)af3 describes the metric tensor (4.23) of fiat space-time in an inertial frame (the 
Minkowski metric). Since all inertial frames are equivalent, rjais does not change in 
transformations from one inertial frame to another, for example in the transforma- 
tion from the laboratory frame to the locally co-moving frame and vice versa. Using 
this insight from relativity, we also develop a physical interpretation for u"' and Ua 
that allows us to translate the g"'^ and ga/3 back to the laboratory frame: we regard 
u" as the local four-velocity of the medium, 

= r^»0up = ^, (5.25) 
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Here ds is the line element (4.23) in fiat space-time, 



dx" dx^ = c^dt^ - dx^ - d?/2 - dz^ 



1 - 



u 



de 



(5.26) 



expressed in terms of the three-dimensional velocity vector of the medium 

dx 
'dt' 



u 



For zero velocity, i.e. in the locally co-moving frame, we get u"^ 



(5.27) 
(1,0,0,0). 

Hence we can indeed interpret the m° and Ua in the metric (5.23) as the local 
four-velocities. The important point is that transforms like the four-dimensional 
coordinate vector x", as the position of the index in u°' indicates, because ds is 
invariant. Consequently, the expressions (5.23) are not only true in locally co- 
moving frames, but in the laboratory frame as well. All we have to do is to calculate 
li" and Ua for non-zero velocities (the local velocities of the moving medium). We 
obtain from Eqs. (5.25), (5.26) and (5.27) the expressions 

[l,~u/c) 



u 



- m2/c2 



Ur 



- mVc2 



(5.2^ 



In this way we have established the space-time geometry of moving media discovered 
in 1923 by Walter Gordon (and independently rediscovered several times [651 11081 
I109j ). In matrix form, the metric tensors appear as 



/ c n — 

(? — V? 



c2 — V? 



CU 



n' - 1 



V C2 - 



CU 



-1 + 



n' -I 



U (^^ u 



det {gai3) = -n ^, 
/ c^n~^ — 1 — n~ 
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1 — n 
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2 



c2 — 



CU 



V C2- 



cu 



-1 



n 



- 1 



u 



u 



I 



(5.29) 
(5.30) 
(5.31) 



We obtain from Eq. (4.29) the dielectric tensors of moving media (that are locally 
isotropic and impedance matched), 

n 



1 — u^'n? /c^ 



1 



u 



1 + (1 



u® u 



nt 



(5.32) 



in the limit of low velocity, u/c <^ 1. Furthermore, since for moving media ga/i 
contains mixed space-time elements goi, we also obtain the vector 



n 



w 



u^'n? /c' 



u 



[n 



l)u. 



(5.33) 



In this way we have found a possible physical interpretation for the bi-anisotropy 
vector w of electromagnetism in space-time geometries: w may appear as the ve- 
locity of a moving medium. 
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5.5 Optical Aharonov-Bohm effect 

Perfect invisibility devices and perfect lenses exploit transformations to non-trivial 
topologies in space - excluded regions in physical space or folds in electromagnetic 
space. Here we study a simple example, the propagation of light through a vortex, 
that turns out to represent a transformation medium with multi-valued space-time 
geometry. Consider a vortex in a fluid, say in water flowing down a plug hole. A 
vortex concentrates the vorticity V x w of the swirling flow u in one line, the vortex 
core. In Cartesian coordinates the velocity profile of a straight vortex is given by 
the expression 

u = \ ^ \. (5.34) 



+ y 







because for this profile V x w vanishes, except at the z axis where u diverges, but 
the circulation § u ■ dr gives the finite value 27rW. All vorticity is concentrated 
along the vortex line. 



It might be a useful exercise for the reader to express the velocity profile (5.34) of 
the vortex in cylindrical coordinates (the most natural coordinates for this situation). 
According to Eqs. (3.27) and ( 3.7[ ) the coordinate-basis vectors of the cylindrical 



coordinates are 

= cos</)i + sin</)j, eq = —r sm.(j)i + r coscf) j , = k. (5.35) 

The metric tensor is the familiar = diag(l,r^, 1). One obtains for the velocity 
profile of the vortex 

u' = ^ \ 1 I , = (0,W,0). (5.36) 




We immediately see from the expressions (3.85) of the curl in arbitrary coordinates 



that V X w vanishes, because Ui is constant. The vorticity is concentrated in the z 
axis that is excluded from the cylindrical coordinates. 

Imagine the vortex is illuminated from the side. Any moving medium drags light, 
as Fresnel |3^ discovered in 1818 (deducing the correct result without knowing spe- 
cial relativity) and as Fizeau [36j observed for water in uniform motion in 1851 
(in interferometric measurements without lasers). The basics of Fresnel's drag are 
very simple: light propagating with the flow is advanced, whereas light propagating 
against the current lags behind. (The degree of dragging, Fresnel's dragging coeffi- 
cient, depends on special relativity though.) Dragged by the moving medium, light 
rays experience phase shifts and, if the phase fronts are deformed, light rays are de- 
flected. For flow speeds u much smaller than the speed of light in the medium, c/n, 
we expect that the phase shift is proportional to the integral of u along the prop- 
agation of light. For regions with vanishing vorticity, we can deform the contours 
of the phase integrals j u ■ dr and so the phase difference between light waves that 
have passed the vortex on different sides is proportional to the circulation ^ u-dr, a 
constant. Hence we expect that the water vortex does not deflect light, but imprints 
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a characteristic phase shp, in analogy to the Aharonov-Bohm effect |H 1105^ [l36] . In 
the Aharonov-Bohm effect charged matter waves, electrons for example, are not de- 
flected by a vortex in the magnetic vector potential, but experience a characteristic 
phase shift. The optical analogue of the Aharonov-Bohm effect was first described 
by Jon Hannay in his PhD thesis 09] and has also been independently rediscovered 
PU] . The effect could be used to detect quantum vortices with slow light [HS] and it 
is related to the Aharonov-Bohm effect with surface waves [HI I115[ I146j and to the 
the gravitational Aharonov-Bohm effect |130 j. 

Assuming that in the optical Aharonov-Bohm effect the phase modulation is pro- 
portional to the integral of j u-dr we guess a space-time coordinate transformation 
that should describe the wave propagation. Then we use our formalism to verify 
that this guess is correct. The integral of the velocity profile (5.34) gives 

-ydx + xdy 



u- dr = W 



W / d arctan 



(5.37) 



This phase should modulate the time evolution of the wave. The water has a uniform 
refractive index n that simply rescales the wavelength. Therefore we assume in 
physical space, using cylindrical coordinates. 



ct = ct' — a (f)' , 



r 



(5.3^ 



= 0, Z = - 

n n 
with a constant a to be determined later. In electromagnetic space we obtain 

ct' = ct + a(f), r' = nr, 0' = 0, z' = nz. (5.39) 



Electromagnetic space is flat and empty with the Minkowski line element (5.26) in 
primed coordinates expressed in physical coordinates, 

ds^ = {cdt + adcpy -n^{dr^ + Pd^'^) - dz^ . (5.40) 

So the metric tensors are 
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(5.41) 



(5.42) 



(5.43) 



We recall that in cylindrical coordinates the spatial metric is described by the ma- 
trix r = diag(l,r2,l) with determinant 7 = and obtain from the constitutive 
equations (4.29) the dielectric tensors 

£ = = ndiag (1, r"2, 1) (5.44) 
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and hence 

£T = fiT = nl; (5.45) 
the medium is isotropic with refractive index n. We also get from the constitutive 



equations (4.29) the bi-anisotropy vector in cyhndrical coordinates 

Wi = {0,a,0) (5.46) 



that corresponds to the velocity profile (5.36) in the low-velocity limit ( 5.33[ ) for 



a = {n'-l)W. (5.47) 



These results prove that the space-time transformation (5.39) generates from elec- 



tromagnetic waves in empty space solutions of Maxwell's equations for the vortex 



(5.34) in the limit of low velocities. 

However, although this solution describes the dominant features of the wave 
propagation through the vortex, it does not have the right topology in general; 
the solution is not periodic in the angle 0. For example, for a monochromatic 
wave of frequency u the phase grows by 27r a after completing one circle, which is 
not an integer multiple of 27r in general. If we take the coordinate transformation 



(5.38) literally, physical space has become multi- valued, with a branch cut in the 



direction of incidence. This branch cut describes the phase slip of light after passing 



through the vortex, but it cannot possibly be mathematically rigorous. Figure 17 



shows the correct solution due to Aharonov and Bohm [T| when additional scattering 



resolves the topological dilemma of the coordinate transformation (5.38). But this 
is a subject beyond the scope of this article, and so is hght propagation in rapidly 
spinning media [65] that describes how light may get sucked into the vortex. 

5.6 Analogue of the event horizon 

Transformation media are at the heart of invisibility devices, perfect lenses and the 
optical Aharonov-Bohm effect; here we explain that they also describe much of the 
essential physics at the event horizon of a black hole. In 1972 William Unruh |140j 
invented a simple analogy of the event horizon as an illustration for a colloquium at 
Oxford: black holes resemble rivers. Imagine a river is flowing towards a waterfall. 
Suppose that the river carries waves that propagate with speed c' relative to the 
water, but the water is rapidly moving with velocity u that at some point exceeds 
c'. Beyond this point, waves are swept away towards the waterfall. The point of no 
return is the horizon of the black hole. Imagine another situation: a river flowing out 
into the sea, getting slower. Waves coming from the sea are blocked at the line where 
the flow exceeds the wave speed |132l 1133] . Here the river estabhshes the analogue of 
the white hole, an object that nothing can enter. Such analogies occur also in many 
other situations and they illustrate some essential features of the event horizon, see 



Fig. [T8j Horizons are perfect traps: one would not notice anything suspicious while 
passing the horizon with the flow, but to return is impossible. In the other direction, 
trying to escape against the current, waves get stuck at the horizon; they freeze with 
dramatically shrinking wavelengths. These analogues are not mere analogies, they 
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Figure 17: Aharonov-Bohm effect. A ffuid vortex generates tlie optical Aliaronov-Bolim 



effect described by tlie coordinate transformation (5.38). Light, incident from the right 



is Fresnel-dragged by the moving medium: light propagating with the flow is advanced, 
whereas light propagating against the current is retarded. The wave should develop the 



phase slip shown in the left figure. However, although the transformation (5.38) is exact 



physical space-time would be described in multi-valued coordinates here. Instead of the 
simple phase slip, the light turns out to exhibit the characteristic interference pattern 
illustrated in the right figure jl]. 



are mathematically equivalent to wave propagation in general relativity (as long 
as higher-order dispersion is irrelevant [171 EU I138j ). For example, sound waves 
in moving fluids behave like waves in a certain space-time geometry that depends 
on the flow and the local velocity of sound |9T|, I137[ 1145] . Or, as we have already 
explained, light experiences a moving medium as the effective space-time geometry 



(5.23). At the horizon, where the flow u reaches the speed of light c/n, the measure 



of time goo in Gordon's metric (5.29) vanishes. Time comes to a standstill. Close to 



the horizon the wavelength is dramatically reduced (until due to optical dispersion 
the refractive index changes, tuning the light out of the grip of the horizon.) For 
short wavelengths the lateral dimensions of the horizon are irrelevant. Therefore, 
the optics at the horizon is essentially one-dimensional. 

Consider a one-dimensional model: both medium and light move along the z axis, 
the electromagnetic field vectors are pointing orthogonal to z in the x, y plane. In 
the following we show that this situation corresponds to a space-time transformation 
medium. We write down a coordinate transformation that maps one-dimensional 
wave propagation in moving media to propagation in empty electromagnetic space. 
How to find this transformation? In an empty one-dimensional space light would 
freely propagate with the speed c as a superposition of wavepackets that move 
either in positive or negative direction while maintaining their shapes. So these two 
wavepackets depend only on 



t 



± 



t'T 



(5.48) 



In physical space, the wavepackets do not propagate with the speed of light in 
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Figure 18: Aquatic analogue of tlie event liorizon. (From Ref. |25J. Reprinted with 
permission of Peter Hoey.) Current can stop fish moving upstream and mimic an event 
horizon. A moving optical medium captures the physics at the event horizon, too. A pulse 
in an optical fibre creates such a moving medium |106) . 



vacuum, but with the velocity v± that is given by the relativistic addition [6T] of 
the flow speed u and the speed of Hght ±c/n in positive or negative direction, 



t± = t- 



v± 



u± 



v± 



n 



1 ± 



u 



(5.49) 



cn 



So we expect that the one-dimensional wave propagation in the moving medium 
corresponds to waves in empty space by the transformation 



X 



X - 



y = y^ 



it- - 1, 



(5.50) 



To prove this assertion we apply the theory of transformation media. For u = 
(0, 0, u) the effective space-time geometry of the moving medium is described by the 
metric (5.29) with 
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-l)cu \ 



— 








— 



(5.51) 



We transform g"^^ to electromagnetic space. 



(5.52) 
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with the matrix A"'^ that we obtain by differentiating the new coordinates x"' with 
respect to the old coordinates x" of physical space 



A"' 



1 

1 

1 





in' 



-l)cu \ 



— n^n^ 




n{c^ — u^) 

g2 _ ^2^2 / 



(5.53) 



The result is the diagonal matrix 



diag 



with the inverse 



Qaip' = diag 



^2 _ ^2^2 ' 



-1,-1 



-1,-1 



and the determinant of the metric 



9 = - 



(5.54) 



(5.55) 



(5.56) 



n4(c2 -^2)2- 

The matrix gai^i describes the geometry in electromagnetic space. To find out 



how this geometry appears as a medium we use the constitutive equations (4.25) 



in electromagnetic space, with primed instead of unprimed metric tensors. Since 
g^'f^i is diagonal, the bi-anisotropy vector w' vanishes: in electromagnetic space the 
medium is at rest. We also obtain 



1. 



(5.57) 



Since electromagnetic waves propagating in the z direction are polarized in the x, y 
plane their electromagnetic fields only experience the x and y components of the 
dielectric tensors. Consequently, for one-dimensional wave propagation electromag- 
netic space is empty, waves are free here, whereas in physical space they appear as 



modulated wavepackets according to the transformations (5.49) and (5.50). 



We mapped one-dimensional electromagnetic waves in moving media onto waves 
in empty space, but what happens at the horizon? Why are horizons special? Sup- 
pose, without loss of generality, that the medium moves in the negative z direction 
and develops a black-hole horizon at z = 0: for positive z the flow speed \u\ lies 
below c/n and for negative z the flow exceeds c/n. For the waves that propagate 



against the current in the positive z direction the transformation (5.49) develops a 
pole at the horizon. We linearize u{z) here and obtain 



t - 



In Izl 



a 



a 



1 



n 



dw 

Tz 



(5.5^ 



For any given time t, the entire range of t+ is filled for z > or z < 0, for either 
side of the horizon. So electromagnetic space has become multi-valued and physical 
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space is cut into two distinct regions, because waves confined to either one of tfie 
two sides would never interact witli eacli otlier. But tliis is not entirely true [SI]. 
Picture a wavepacket that, after having barely escaped from the horizon, propagates 
in a region where the medium moves at uniform speed. The wavepacket is purely 
forward-propagating; so it consists of a superpostion of plane waves with purely 
positive wavenumbers k and corresponding positive frequencies u. On the other 
hand, recall the result from complex analysis pTj that Fourier transforms with positive 
spectra are analytic on the upper half plane (because integrals over k containing the 
analytic function exp(ifcz) converge and are continuous for positive k and positive 
Im^;). Consequently, if we analytically continue the wave packet on the complex z 
plane it should be analytic for Imz > 0, even if we trace its evolution back to the 
horizon. Analytic functions are continuous; so the escaping wavepacket must have 
partially originated from beyond the horizon — some part of the wave must have 
tunneled through. In order to calculate the tunnel amplitude we use the following 
trick [28]: we combine two waves exp(— iti;t+) localized on either side of the horizon 
such that they seamlessly form an analytic function on the upper half z plane. From 

exp ^— vr — \-i\n{—z)^ = exp (^—\ii{z)j (5.59) 

follows that we should give the waves on the left side of the horizon the prefactor 
exp(— TTcu/o;). Then the two waves combined make an analytic function. On the left 
side the medium moves faster than the speed of light in the medium. So here the 
phase of the wave moves backwards. In locally co-moving frames the wave would 
oscillate with negative frequencies. Such negative frequencies have recently been 
observed in water waves |114j . The negative- frequency wave makes a negative con- 
tribution to the energy; so we ought to subtract the modulus square of its amplitude 
in the total intensity of the wave (a technically more involved argument [681 1107j 
confirms this reasoning). Therefore the ratio of the tunneled to the total spectral 
intensity is 



exp I —2n — 

N = ^ ^ 

1 — exp —271— 
\ a 



expf — 1-1 



1 -1 



(5.60) 



According to the quantum field theory at horizons [TUl [ISl EHl 1107] , positive- and 
negative-frequency photon pairs are spontaneously created from the quantum vac- 
uum, because they do not cost any energy. They are emitted at the tunneling rate 



(5.60) that one can also interpret as the Planck spectrum [79 

huj 



N 



exp 



ksT 

with temperature {ks denotes Boltzmann's constant 



(5.61) 



ksT = —. (5.62) 
Black holes are not black after all. The horizon emits light with a Planck spectrum 



(5.61) and temperature (5.62). The light consists of entangled photon pairs where 
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each pair contains one photon with positive and one with negative frequency, one 
emitted outside and the other inside the horizon. Seen from one side of the hori- 
zon, the black hole is a black-body radiator. This flash of insight, due to Stephen 
Hawking in 1974 |51], has been one of the most influential predictions of modern the- 
oretical physics, because it illuminated a vastly unexplored intellectual landscape: 
Hawking's theory supports Bekenstein's idea |H] that the horizon carries entropy 
that is proportional to its area and it is a quantum phenomenon of a space-time 
geometry. Hawking's effect thus connects three vastly different areas of physics — 
thermodynamics, quantum mechanics and general relativity. In modern attempts 
for finding a quantum theory of gravity, such as loop quantum gravity [116J and 
superstring theory |12], the correct prediction of the Bekenstein-Hawking entropy 
has been used as a benchmark. However, the Hawking temperature of solar-mass 
black holes lies about eight orders of magnitude below the temperature of the cosmic 
microwave background; so, most probably, there is no chance of directly observing 
Hawking radiation in astrophysics. The benchmark of some of the most advanced 
theories of physics seems destined to remain theory. 

On the other hand, laboratory analogues of the event horizon may demonstrate 
the physics behind Hawking radiation [95], 11061 11391 1147] . For example, one could 
perhaps generate a detectable amount of Hawking radiation using few-cycle light 
pulses fibi [55] in photonic-crystal fibres jll8] . According to nonlinear fibre optics 
[3| the pulses behave, for all practical purposes, like one-dimensional moving media 
[7CT] 1106] : due to the optical Kerr effect they create additional contributions to 
the refractive index that move with the light pulses — media that move at the 
speed of light. Apart from optical dispersion [H], such moving media resemble the 
case considered in this article. How Hawking radiation emerges from elementary 
processes in gravity has remained largely a mystery, but laboratory analogues have 
a chance to, quite literally, shed hght on one of the most fascinating pieces of physics, 
the creation of light at horizons. 

6 Summary 

Optical media appear as geometries and geometries appear as media. Or, to be more 
precise, dielectric media appear to light as if they were changing the geometry of 
space (or of space-time if the media are moving or are bi-anisotropic). The modified 
and sometimes distorted geometry perceived by light is the cause of many optical 
illusions, including the ultimate illusion: invisibility [721 1104] . In order to deliber- 
ately create such an illusion one may design the desired geometry first and then 
construct a dielectric material for implementing it in practice. The simplest geome- 
tries are created by coordinate transformations where light propagation in physical 
space appears to be curved, but where, in a transformed space, light propagates 
along straight lines. Media that facilitate coordinate transformations are known as 
transformation media. 

Connections between geometry and optics have a long and distinguished history: 
Fermat's principle has inspired the principle of least action in classical mechanics, 
the connection between geometrical optics and wave optics motivated quantum me- 
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chanics and path integrals in quantum field theory and statistical mechanics, and 
the principle of least action inspired the idea of inertial motion along geodesies in 
general relativity. But transformation optics is probably the first case where ge- 
ometrical ideas that typically belong to the lofty sphere of general relativity have 
become practically useful in rather down-to-earth engineering applications. Both 
sides benefit from this connection: engineers get motivated to learn and appreciate 
the tools of geometry, because applications are a great incentive to learn, and applied 
mathematicians or relativists are confronted with and often delighted by the way in 
which these ideas appear in the real world, both in technical applications and also 
in some natural phenomena on Earth. Connections between theory and experiment 
and between applied and fundamental science have been the greatest strength of 
science. The subject of this article, transformation optics, is an example of such 
connections. We hope to have written it in a form such that both practical-minded 
engineers and theoretical-minded physicists and mathematicians find some treasure 
(and pleasure) in it. 

This article is a primer, an introduction to the geometry of light and the concepts 
of transformation optics. We focused on the principal ideas and introduced them 
without assuming much prior knowledge, only standard mathematics, some basic 
physics and sufficient stamina of the reader. Being a primer, the article necessarily 
omits many aspects of this field. We analyzed only four examples of transformation 
media — cloaking devices, perfect lenses, vortices and horizons — but these four 
cases both illustrate characteristic non-trivial topologies, each one with different 
physics, and they have been experimentally verified, at least to some extent. We did 
not discuss transformation media that re-scale space without changing the topology, 
see for example Refs. [56l I124j . nor did we describe intriguing ideas that have not 
been implemented yet such as electromagnetic wormholes [16]. We omitted the 
analysis of active cloaking devices [H3] or plasmonic coverings ^ EZ] , because they 
are not directly related to transformation media (but perhaps the latter may appear 
as transformation media in disguise). We also did not discuss cloaking devices that 
combine coordinate transformations with refractive-index profiles that cannot be 
transformed away, first ideas of non-Euclidian cloaking [73| [98] . Furthermore, we 
did not explain in detail how transformation media are implemented in practice, 
where the properties of metamaterials come from and how they are made, because 
this is a subject that fills entire books |Hni ESI 1120] . We focused on the main ideas 
and some connections between optics, in particular transformation optics, and other 
areas of physics and mathematics. It is quite remarkable how many different ideas 
optics combines and connects, but also here we were forced to make omissions. For 
example, we did not discuss applications of transformation media in acoustics, see 
for instance Refs. [HI 123, EH EH], and in quantum mechanics [72]. We also focused 
primarily on the classical optics of transformation media and only mentioned their 
quantum optics [681 EH 1107] without going into detail. Last, and least, we did 
not even attempt to represent the recent literature related to transformation optics, 
because this literature is too recent and rapidly growing — had we included it this 
article would be outdated in a very short time. We hope to have compensated for 
these shortcomings and omissions by being clear and pedagogical in the main ideas 



61 



and by focusing on the "new things in old things" and explaining "old things in new 
things" , by teUing the aspects of the story that we believe are already guaranteed 
to last and to remain inspiring for a long time. 
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Appendix A 



The physical principle underlying general relativity is that all the laws of physics 
should be expressible as tensor equations in four-dimensional (possibly curved) 
space-time pTj. As a consequence of the transformation properties of tensors, this 
means that the laws of physics will have the same form in any space-time coor- 
dinate system. This property of tensor equations is called general covariance. In 
this appendix we show that the free-space Maxwell equations in generally covariant 
form are equivalent to Maxwell's equations in a material medium with constitutive 



equations ( |4.25[ )-( |4.26[ ). 

The space-time form of Maxwell's equations is written in terms of the electro- 
magnetic field tensor F^j,, constructed from the the E and B fields; in a right-handed 
coordinate system 
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The generally covariant Maxwell equations are P7] 
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(A2) 



{p,f/c) is the four-current, and the square brackets denote antisym- 
This last operation produces a completely antisymmetric tensor with 



where = 
metrization. 

three indices. In Eq. (A2) -F[/x!/,a] is explicitly 

-^[/i!^,A] E^,y \ -\- Fp\^^ -\- F\^ y^ 

since the electromagnetic field tensor already is antisymmetric: F^^, = —Fy^. The 
forms of (A2) containing partial rather than covariant derivatives are obtained by 
using the antisymmetry of F^^ and F^^^ . In a left-handed coordinate system the 



(A3) 



relation between F^u and the magnetic field differs by a sign from that in Eq. (Al). 



In this Appendix we will confine ourselves to right-handed systems; this issue of 
handedness is dealt with in §4. We define a quantity H^" by 



H>^' = e.^gF^^ = soV^g^'g^'Exp 
1 



and regard H^^ as being constructed from D and H fields as follows: 
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Then, introducing a new four-current = y/—gj^, the free-space equations (A2) 
can be written as 

= 0, H^\, = (A7) 



which are the macroscopic Maxwell equations (4.5) in right-handed Cartesian coor- 
dinates. The constitutive equations are given by Eqs. (Al ) and (A4)-(A6). To obtain 
relations between the vector fields D, H and E, B consider first the components 

[3kl]gojgikHK 



Foi] from Eqs. (Al), (A5| and (|A6|) one obtains 
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We simplify this result as follows. The identity 
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Use of Eqs. (AlO) or (All) in Eq. (A12) produces the two relations 
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(A8) 
(A9) 

(AlO) 

(All) 
(A12) 

(A13) 



which reveal inverse-related 3x3 matrices. In view of Eqs. (A12) and (A13), mul- 
tiplying Eq. (A8) by and contracting on the index i yields 
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(A14) 



the first of the constitutive equations (4.25) with (4.26). 

To obtain the second constitutive relation, we employ the tensors dual to F^^, 
and H'^". This requires use of the 4D Levi-Civita tensor, which in a right-handed 
system is given by 
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The dual tensors *F^^^ and *H^y are defined by 
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so they have components 
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Re-expressed in terms of the dual tensors, the constitutive equations (A4)-([A5j) read 
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(A20) 

= -^=9^-^ *^Ap, (A21) 
£oV-9 

where we apphed the four- dimensional version of the double vector product (3.51) 



Writing out *Hoi using Eqs. (|A18j)-(jA20j) one finds 
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Comparison of this with Eqs. (A8) and (A14) shows that 
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Bi 



-9 



eoc^fl'oo 
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(A24) 



which is the second of the constitutive equations (4.25) with (4.26). 



In this way we derived Plebanski's constitutive equations (4.25)-(4.26) |112j . 
Note that several other relations between D, H, E and B are contained in Eqs. 
(A4)-(A5) and (A20)-(A21). For example, to express D and H in terms of E and 
B we need only take the time-space components of (A4), obtaining 

= eoV^ {9'' 9'' - 9'' 9'') Ej - eoc^[jkl]g''g'^Bi, (A25) 



and the required formulae are Eqs. (A23) and (A25). 
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